ACAT 2017

Name: ACAT 2017
Start: 2017-08-21T07:45:00-07:00
End: 2017-08-25T18:00:00-07:00
Location: University of Washington, Seattle

21–25 Aug 2017

University of Washington, Seattle

US/Pacific timezone

Need Help?

Speeding up prediction performance of the boosting decision trees-based learning models.

24 Aug 2017, 14:00

20m

107 (Alder Hall)

107

Alder Hall

Oral Track 2: Data Analysis - Algorithms and Tools Track 2: Data Analysis - Algorithms and Tools

Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))

The result of many machine learning algorithms are computational complex models. And further growth in the quality of the such models usually leads to a deterioration in the applying times. However, such high quality models are desirable to be used in the conditions of limited resources (memory or cpu time).
This article discusses how to trade the quality of the model for the speed of its applying a novel boosted trees algorithm called Catboost. The idea is to combine two approaches: training fewer trees and uniting trees into huge cubes. The proposed method allows for pareto-optimal reduction of the computational complexity of the decision tree model with regard to the quality of the model. In the considered example number of lookups was decreased from 5000 to only 6 (speedup factor of 1000) while AUC score of the model was reduced by less than per mil.

Mr Egor Khairullin (Moscow Institute of Physics and Technology, Yandex School of Data Analysis (RU)) Andrey Ustyuzhanin (Yandex School of Data Analysis (RU))

AndreyUstyuzhanin-DecisionTensor-v2.pdf

Video

paper.pdf

ACAT 2017

Need Help?

Speeding up prediction performance of the boosting decision trees-based learning models.

107

Alder Hall

Speaker

Description

Authors

Presentation materials

Peer reviewing

Paper

Choose timezone

ACAT 2017

Need Help?

Speaker

Description

Authors

Presentation materials

Peer reviewing

Paper