Skip to content
View bmservilha's full-sized avatar

Block or report bmservilha

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
HTML 188 35 Updated Feb 24, 2025

System design patterns for machine learning

2,545 282 Updated Oct 7, 2021

A large (+2500) collection of color maps for Python

Python 296 9 Updated Feb 21, 2025

A Desktop App for YouTube Music

TypeScript 4,274 461 Updated Feb 20, 2025

Command line and webapp for retrosynthetic disconnections, molecular complexity and synthetic accessibility metrics

Roff 14 2 Updated Aug 14, 2024

A tool for building feature stores.

Python 296 37 Updated Mar 6, 2025

The best repository showing why SMOTE and resampling methods might not be the answer for imbalanced data problems

35 2 Updated Mar 2, 2025

8 Lessons, Kick-start Your Cybersecurity Learning.

HTML 4,870 601 Updated Feb 13, 2025

Spark: The Definitive Guide's Code Repository

Scala 2,931 2,816 Updated Aug 26, 2020

Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews

Jupyter Notebook 110 98 Updated May 19, 2024

pyiron - an integrated development environment (IDE) for computational materials science.

Jupyter Notebook 386 49 Updated Jan 14, 2025

A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.

Python 2,046 93 Updated Sep 21, 2024

Apache Spark - A unified analytics engine for large-scale data processing

Scala 40,693 28,538 Updated Mar 7, 2025

O'Reilly Book: [Data Algorithms with Spark] by Mahmoud Parsian

Python 213 92 Updated Jun 26, 2023

Jornada engenharia de dados 2025

Python 535 134 Updated Feb 27, 2025

A docker compose file to deploy a production ready Metabase instance

46 12 Updated Dec 25, 2024

Open Drug Discovery Toolkit

Python 432 122 Updated Dec 13, 2022

Python suite for optimization of stationary points on ground- and excited states PES and determination of reaction paths.

Python 107 36 Updated Mar 8, 2025

The AMLSim project is intended to provide a multi-agent based simulator that generates synthetic banking transaction data together with a set of known money laundering patterns - mainly for the pur…

Python 276 84 Updated Apr 14, 2023

Figures and code examples from Bayesian Analysis with Python (third edition)

Jupyter Notebook 180 57 Updated Jan 22, 2025

The repo contains the main topics carried out in my master's thesis on operational risk. In particular, it is described how to implement the so called Loss Distribution Approach (LDA), which is con…

R 7 2 Updated Mar 4, 2021

Source for book "Feature Engineering A-Z"

HTML 137 12 Updated Mar 7, 2025

Protein-Ligand Benchmark Dataset for Free Energy Calculations

Python 166 16 Updated Jul 29, 2024

Applied Data Science for Credit Risk

HTML 103 32 Updated Mar 6, 2025

😎 A curated list of awesome MLOps tools

Python 4,354 602 Updated Nov 29, 2024

Machine Learning Engineering Open Book

Python 13,102 799 Updated Mar 9, 2025

Passo a passo para instalar no linux/wsl ubuntu

9 1 Updated Jan 13, 2024

✨ A Pydantic to PySpark schema library

Python 72 11 Updated Mar 7, 2025

PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics

Jupyter Notebook 1,242 126 Updated Oct 28, 2024
Next