Skip to content
/ xgboost Public
forked from dmlc/xgboost

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow

License

Notifications You must be signed in to change notification settings

sagunb/xgboost

Repository files navigation

eXtreme Gradient Boosting

Build Status Documentation Status GitHub license CRAN Status Badge PyPI version Gitter chat for developers at https://gitter.im/dmlc/xgboost

XGBoost is an optimized distributed gradient boosting library designed to be highly efficient, flexible and portable. It implements machine learning algorithms under the Gradient Boosting framework. XGBoost provides a parallel tree boosting(also known as GBDT, GBM) that solve many data science problems in a fast and accurate way. The same code runs on major distributed environment(Hadoop, SGE, MPI) and can solve problems beyond billions of examples. XGBoost is part of DMLC projects.

Contents

What's New

Version

  • Current version xgboost-0.6 (brick)

Features

  • Easily accessible through CLI, python, R, Julia
  • Its fast! Benchmark numbers comparing xgboost, H20, Spark, R - benchm-ml numbers
  • Memory efficient - Handles sparse matrices, supports external memory
  • Accurate prediction, and used extensively by data scientists and kagglers - highlight links
  • Distributed version runs on Hadoop (YARN), MPI, SGE etc., scales to billions of examples.

Bug Reporting

Contributing to XGBoost

XGBoost has been developed and used by a group of active community members. Everyone is more than welcome to contribute. It is a way to make the project better and more accessible to more users.

License

© Contributors, 2015. Licensed under an Apache-2 license.

About

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • C++ 38.9%
  • Scala 13.8%
  • R 12.8%
  • Python 12.5%
  • Cuda 11.6%
  • Java 5.6%
  • Other 4.8%