Skip to content
View mouradap's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report mouradap

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mouradap/README.md

Welcome to my GitHub Profile!

Profile Banner

πŸ‘‹ About Me

Hello! I'm Denis Moura, a Senior Data Engineer with over 4 years of experience in building scalable and resilient data pipelines and platforms. My passion lies in working with data and solving complex data challenges using modern technologies. Skilled in Python, SQL, AWS, and data engineering best practices, I also bring a strong focus on cost optimization and actionable insights. You can find me on LinkedIn and explore my work here on GitHub.


πŸ‘¨β€πŸ’Ό Professional Resume

Summary

Senior Data Engineer with 4+ years of experience in designing scalable data pipelines and cloud-native solutions. Skilled in Python, SQL, and AWS, with expertise in migrating systems to Snowflake, developing ETL pipelines, and optimizing data analytics. Proven leader, interfacing client needs with engineering teams to deliver robust, user-focused data solutions.


Skills and Technologies

  • Programming Languages: Python, SQL, JavaScript
  • Big Data Technologies: Snowflake, Iceberg, Spark, Kafka
  • Cloud Platforms: AWS (S3, Glue, Lambda, Step Functions, Athena), GCP (BigQuery, Storage)
  • Data Orchestration: Airflow, Step Functions
  • DevOps & Tools: Terraform, Docker, Kubernetes, Git, GitHub Actions
  • Visualization Tools: PowerBI, Sigma Computing, Looker Studio, Metabase
  • Data Modeling & Governance: Data Lakes, Lake Formation, Data Quality Validation

Professional Experience

Lead Data Engineer

Dexian Disys (U.S. based nearshore – remote)
2022 – Present

  • Engineered a scalable data migration platform using Airflow, reducing AWS operational costs by 30%.
  • Migrated 200+ views to Snowflake, improving query performance by 40%.
  • Automated MS Teams-to-Snowflake ingestion, ensuring data privacy and workflow efficiency.
  • Delivered actionable insights through dashboards built in Sigma Computing and PowerBI.
  • Designed secure data upload tools for daily team use, handling gigabytes of data seamlessly.
  • Led a team of 4 experienced Data Engineers, allowing a charismatic example-based leadership. Interfaced final users to data owners and the engineering team to develop robust, client-first data solutions.

Data Engineer

Varsomics – Hospital Israelita Albert Einstein (Brazil – remote)
2021 – 2022

  • Migrated 70TB genomic data to AWS Data Lake, enabling faster analytics with robust governance.
  • Created a Delta Lake structure with efficient data governance via Lake Formation and Athena.
  • Developed dashboards to monitor genomics pipelines, simplifying internal data analysis processes.

Software Engineer

PickCells (Brazil – Local)
2020 – 2021

  • Automated microscopy processes integrating robot movements and camera focus with Python.
  • Led an on-premises to AWS cloud migration project with real-time pipelines using Airflow.
  • Conducted COVID-19 network analysis using Python and advanced data science techniques.

Education

  • Ph.D. in Applied Biology (Bioinformatics), Universidade Federal de Pernambuco, 2022
  • M.Sc. in Applied Biology (Neuroscience), Universidade Federal de Pernambuco, 2018
  • B.Sc. in Biology, Universidade Federal de Pernambuco, 2015

Certifications

  • Astronomer Certification for Apache Airflow Fundamentals (2024)
  • AWS Partner: Security Governance at Scale (2021)
  • Confluent Apache Kafka Fundamentals (2021)

Projects and Achievements

  • Data Migration Platform: Designed a scalable platform in Airflow, optimizing ETL workflows and cutting costs by 30%.
  • Genomic Data Lake: Led a genomic data migration project, enhancing compliance and reducing query times by 25%.
  • Microscopy Automation: Created a Python-based microscopy solution, boosting research capabilities.
  • Compliance Tool: Built a React/NodeJS tool for tracking and managing compliance-sensitive data in Snowflake.

πŸ“« Get in Touch

Feel free to reach out via Email or connect with me on LinkedIn.


Thank you for visiting my GitHub profile! Explore my repositories, and don’t hesitate to connect for collaboration opportunities or questions.

Pinned Loading

  1. blood_cell_classification_keras_fastapi_react blood_cell_classification_keras_fastapi_react Public

    This is a complete service of blood cell image classification using a Convolutional Neural Network, with an FastAPI backend and ReactJS web app

    Jupyter Notebook 5 2

  2. covid19stats covid19stats Public

    A Covid-19 statistics tracking app distributed through docker

    Python

  3. MIT-hackathon MIT-hackathon Public

    Forked from iaradsouza1/MIT-hackathon

    Jupyter Notebook

  4. python_algorithms python_algorithms Public

    This repo stores Python implementations of shortest path algorithms, such as Dijkstra and A*.

    Python