GitHub - wumch/xlearn: High performance, easy-to-use, and scalable machine learning (ML) package, including linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM) for Python and CLI interface.

What is xLearn?

xLearn is a high performance, easy-to-use, and scalable machine learning package that contains linear model (LR), factorization machines (FM), and field-aware factorization machines (FFM), all of which can be used to solve large-scale machine learning problems. xLearn is especially useful for solving machine learning problems on large-scale sparse data. Many real world datasets deal with high dimensional sparse feature vectors like a recommendation system where the number of categories and users is on the order of millions. In that case, if you are the user of liblinear, libfm, and libffm, now xLearn is your another better choice.

Get Started! (English)

Get Started! (中文)

Performance

xLearn is developed by high-performance C++ code with careful design and optimizations. Our system is designed to maximize CPU and memory utilization, provide cache-aware computation, and support lock-free learning. By combining these insights, xLearn is 5x-13x faster compared to similar systems.

Ease-of-use

xLearn does not rely on any third-party library and users can just clone the code and compile it by using cmake. Also, xLearn supports very simple Python and CLI interface for data scientists, and it also offers many useful features that have been widely used in machine learning and data mining competitions, such as cross-validation, early-stop, etc.

Scalability

xLearn can be used for solving large-scale machine learning problems. First, xLearn supports out-of-core training, which can handle very large data (TB) by just leveraging the disk of a PC. In addition, xLearn supports distributed training, which scales beyond billions of example across many machines by using the Parameter Server framework.

How to Contribute

xLearn has been developed and used by many active community members. Your help is very valuable to make it better for everyone.

Please contribute if you find any bug in xLearn.
Contribute new features you want to see in xLearn.
Contribute to the tests to make it more reliable.
Contribute to the documents to make it clearer for everyone.
Contribute to the examples to share your experience with other users.
Open issue if you met problems during development.

Note that, please post iusse and contribution in English so that everyone can get help from them.

Contributors (rank randomly)

For Enterprise Users and Call for Sponsors

If you are enterprise users and find xLearn is useful in your work, please let us know, and we are glad to add your company logo here. We also welcome you become a sponsor to make this project better.

What's New

2019-10-13 Andrew Kane add Ruby bindings for xLearn!
2019-4-25 xLearn 0.4.4 version release. Main update:
- Support Python DMatrix
- Better Windows support
- Fix bugs in previous version
2019-3-25 xLearn 0.4.3 version release. Main update:
- Fix bugs in previous version
2019-3-12 xLearn 0.4.2 version release. Main update:
- Release Windows version of xLearn
2019-1-30 xLearn 0.4.1 version release. Main update:
- More flexible data reader
2018-11-22 xLearn 0.4.0 version release. Main update:
- Fix bugs in previous version
- Add online learning for xLearn
2018-11-10 xLearn 0.3.8 version release. Main update:
- Fix bugs in previous version.
- Update early-stop mechanism.
2018-11-08. xLearn gets 2000 star! Congs!
2018-10-29 xLearn 0.3.7 version release. Main update:
- Add incremental Reader, which can save 50% memory cost.
2018-10-22 xLearn 0.3.5 version release. Main update:
- Fix bugs in 0.3.4.
2018-10-21 xLearn 0.3.4 version release. Main update:
- Fix bugs in on-disk training.
- Support new file format.
2018-10-14 xLearn 0.3.3 version release. Main update:
- Fix segmentation fault in prediction task.
- Update early-stop meachnism.
2018-09-21 xLearn 0.3.2 version release. Main update:
- Fix bugs in previous version
- New TXT format for model output
2018-09-08 xLearn uses the new logo:

2018-09-07 The Chinese document is available now!
2018-03-08 xLearn 0.3.0 version release. Main update:
- Fix bugs in previous version
- Solved the memory leak problem for on-disk learning
- Support TXT model checkpoint
- Support Scikit-Learn API
2017-12-18 xLearn 0.2.0 version release. Main update:
- Fix bugs in previous version
- Support pip installation
- New Documents
- Faster FTRL algorithm
2017-11-24 The first version (0.1.0) of xLearn release !

Name		Name	Last commit message	Last commit date
Latest commit History 1,335 Commits
R-package		R-package
demo		demo
doc		doc
docker		docker
gtest		gtest
img		img
python-package		python-package
scripts		scripts
src		src
windows		windows
.gitignore		.gitignore
.travis.yml		.travis.yml
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
README.md		README.md
appveyor.yml		appveyor.yml
build-travis.sh		build-travis.sh
build.bat		build.bat
build.sh		build.sh
makeR.sh		makeR.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is xLearn?

Performance

Ease-of-use

Scalability

How to Contribute

Contributors (rank randomly)

For Enterprise Users and Call for Sponsors

What's New

About

Releases

Packages

Languages

License

wumch/xlearn

Folders and files

Latest commit

History

Repository files navigation

What is xLearn?

Performance

Ease-of-use

Scalability

How to Contribute

Contributors (rank randomly)

For Enterprise Users and Call for Sponsors

What's New

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages