catboost / catboost

A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
下载
介绍

Website |Documentation |Tutorials |Installation |Release Notes

GitHub licensePyPI versionConda VersionGitHub issuesTelegram

CatBoost is a machine learning method based on gradient boosting over decision trees.

Main advantages of CatBoost:  - Superior quality when compared with other GBDT libraries on many datasets.  - Best in class prediction speed.  - Support for both numerical and categorical features.  - Fast GPU and multi-GPU support for training out of the box.  - Visualization tools included.

Get Started and Documentation

All CatBoost documentation is available here.

Install CatBoost by following the guide for the * Python package * R-package * command line

Next you may want to investigate: * Tutorials* Training modes and metrics* Cross-validation* Parameters tuning* Feature importance calculation* Regular and staged predictions

If you cannot open documentation in your browser try adding yastatic.net and yastat.net to the list of allowed domains in your privacy badger.

Catboost models in production

If you want to evaluate Catboost model in your application read model api documentation.

Questions and bug reports

Help to Make CatBoost Better

  • Check out open problems and help wanted issues to see what can be improved, or open an issue if you want something.

  • Add your stories and experience to Awesome CatBoost.

  • To contribute to CatBoost you need to first read CLA text and add to your pull request, that you agree to the terms of the CLA. More information can be found in CONTRIBUTING.md

  • Instructions for contributors can be found here.

News

Latest news are published on twitter.

Reference Paper

Anna Veronika Dorogush, Andrey Gulin, Gleb Gusev, Nikita Kazeev, Liudmila Ostroumova Prokhorenkova, Aleksandr Vorobev "Fighting biases with dynamic boosting". arXiv:1706.09516, 2017.

Anna Veronika Dorogush, Vasily Ershov, Andrey Gulin "CatBoost: gradient boosting with categorical features support". Workshop on ML Systems at NIPS 2017.

License

© YANDEX LLC, 2017-2019. Licensed under the Apache License, Version 2.0. See LICENSE file for more details.


代码语言分布

C++ 82.8%
Python 10.2%
Cuda 3.3%
Scala 0.8%
Makefile 0.6%
R 0.5%
Other 1.8%
相关推荐
tensorflow / tensorflow

一个面向所有人的开源机器学习框架

tensorflow
2021-01-18

opencv / opencv

Open Source Computer Vision Library

opencv
2021-01-26

keras-team / keras

Deep Learning for humans

keras-team
2021-01-19