Skip to content
View szilard's full-sized avatar

Organizations

@user2014 @DataScienceLA

Block or report szilard

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Adaptive and automatic gradient boosting computations.

R 67 11 Updated Aug 20, 2022
HTML 4 Updated Aug 1, 2021

Most recent/important talks given at conferences/meetups

15 1 Updated Nov 27, 2020

GBM intro talk (with R and Python code)

HTML 17 4 Updated May 6, 2021

A curated list of gradient boosting machines (GBM) resources

10 2 Updated May 12, 2019

Advanced workshop on XGBoost with Tianqi Chen in Santa Monica, June 2, 2016

R 26 7 Updated Nov 21, 2016

Szilard Pafka's short bio (to go with conference talk abstracts)

2 Updated Aug 1, 2021
R 4 Updated May 30, 2019

Kaggle scripts: R vs pydata + most popular R and Python packages for Machine Learning

R 11 3 Updated Apr 13, 2017

Tuning GBMs (hyperparameter tuning) and impact on out-of-sample predictions

HTML 21 3 Updated Sep 11, 2017

Code (and other materials) for an introductory talk/workshop on GBMs (developed originally for an R-Ladies Meetup)

HTML 6 4 Updated Jul 15, 2019

Machine Learning #1 and #2 courses at CEU Master of Science in Business Analytics

HTML 21 58 Updated Feb 2, 2019

Machine Learning #1 and #2 courses at CEU Master of Science in Business Analytics

HTML 38 44 Updated Mar 28, 2021

Machine Learning in Production in 1 Slide

1 Updated May 17, 2017

Some thoughts on how to use machine learning in production

72 11 Updated May 17, 2017

Materials for STATS 418 - Tools in Data Science course taught in the Master of Applied Statistics at UCLA

HTML 135 64 Updated Jun 10, 2017

Compare the scoring speed of several open source machine learning libraries.

R 21 4 Updated Jun 19, 2017

Performance of various open source GBM implementations

HTML 217 28 Updated Jun 20, 2024

GBM multicore scaling: h2o, xgboost and lightgbm on multicore and multi-socket systems

HTML 20 1 Updated May 13, 2018

Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more

C++ 20,788 6,777 Updated Oct 25, 2023

useR! 2016 Tutorial: Machine Learning Algorithmic Deep Dive http://user2016.org/tutorials/10.html

Jupyter Notebook 401 204 Updated Mar 5, 2018

Inspired by David Donoho's "50 Years of Data Science" (2015) paper, I'm releasing here a course proposal draft I wrote in 2009 for a possible course of "data science".

9 Updated Apr 6, 2016

A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Python scikit-learn, H2O, xgboost, Spark MLlib etc.) of the top machine learning al…

R 1,877 334 Updated Sep 16, 2022

Materials for a short introductory/intermediate Data Science course taught in the MSc in Business Analytics program at the Central European University

HTML 33 13 Updated Sep 8, 2017

A minimal benchmark of various tools (statistical software, databases etc.) for working with tabular data of moderately large sizes (interactive data analysis).

R 90 17 Updated Jul 25, 2017

Data Science in 1 Slide

4 Updated May 17, 2017

Size of datasets used for analytics based on 10 years of surveys by KDnuggets.

HTML 16 2 Updated Nov 18, 2015

Latency numbers every data scientist should know (aka the pyramid of analytical tasks) - the order of magnitude of computational time for the most common analytical tasks (SQL-like data munging, li…

20 4 Updated Apr 13, 2017

Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.

51 6 Updated Jun 28, 2015

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

C++ 26,544 8,740 Updated Jan 31, 2025
Next