Hello! I'm Denis Moura, a Senior Data Engineer with over 4 years of experience in building scalable and resilient data pipelines and platforms. My passion lies in working with data and solving complex data challenges using modern technologies. Skilled in Python, SQL, AWS, and data engineering best practices, I also bring a strong focus on cost optimization and actionable insights. You can find me on LinkedIn and explore my work here on GitHub.
Senior Data Engineer with 4+ years of experience in designing scalable data pipelines and cloud-native solutions. Skilled in Python, SQL, and AWS, with expertise in migrating systems to Snowflake, developing ETL pipelines, and optimizing data analytics. Proven leader, interfacing client needs with engineering teams to deliver robust, user-focused data solutions.
- Programming Languages: Python, SQL, JavaScript
- Big Data Technologies: Snowflake, Iceberg, Spark, Kafka
- Cloud Platforms: AWS (S3, Glue, Lambda, Step Functions, Athena), GCP (BigQuery, Storage)
- Data Orchestration: Airflow, Step Functions
- DevOps & Tools: Terraform, Docker, Kubernetes, Git, GitHub Actions
- Visualization Tools: PowerBI, Sigma Computing, Looker Studio, Metabase
- Data Modeling & Governance: Data Lakes, Lake Formation, Data Quality Validation
Dexian Disys (U.S. based nearshore β remote)
2022 β Present
- Engineered a scalable data migration platform using Airflow, reducing AWS operational costs by 30%.
- Migrated 200+ views to Snowflake, improving query performance by 40%.
- Automated MS Teams-to-Snowflake ingestion, ensuring data privacy and workflow efficiency.
- Delivered actionable insights through dashboards built in Sigma Computing and PowerBI.
- Designed secure data upload tools for daily team use, handling gigabytes of data seamlessly.
- Led a team of 4 experienced Data Engineers, allowing a charismatic example-based leadership. Interfaced final users to data owners and the engineering team to develop robust, client-first data solutions.
Varsomics β Hospital Israelita Albert Einstein (Brazil β remote)
2021 β 2022
- Migrated 70TB genomic data to AWS Data Lake, enabling faster analytics with robust governance.
- Created a Delta Lake structure with efficient data governance via Lake Formation and Athena.
- Developed dashboards to monitor genomics pipelines, simplifying internal data analysis processes.
PickCells (Brazil β Local)
2020 β 2021
- Automated microscopy processes integrating robot movements and camera focus with Python.
- Led an on-premises to AWS cloud migration project with real-time pipelines using Airflow.
- Conducted COVID-19 network analysis using Python and advanced data science techniques.
- Ph.D. in Applied Biology (Bioinformatics), Universidade Federal de Pernambuco, 2022
- M.Sc. in Applied Biology (Neuroscience), Universidade Federal de Pernambuco, 2018
- B.Sc. in Biology, Universidade Federal de Pernambuco, 2015
- Astronomer Certification for Apache Airflow Fundamentals (2024)
- AWS Partner: Security Governance at Scale (2021)
- Confluent Apache Kafka Fundamentals (2021)
- Data Migration Platform: Designed a scalable platform in Airflow, optimizing ETL workflows and cutting costs by 30%.
- Genomic Data Lake: Led a genomic data migration project, enhancing compliance and reducing query times by 25%.
- Microscopy Automation: Created a Python-based microscopy solution, boosting research capabilities.
- Compliance Tool: Built a React/NodeJS tool for tracking and managing compliance-sensitive data in Snowflake.
Feel free to reach out via Email or connect with me on LinkedIn.
Thank you for visiting my GitHub profile! Explore my repositories, and donβt hesitate to connect for collaboration opportunities or questions.