U.S. flag

An official website of the United States government, Department of Justice.

ATM: A distributed, collaborative, scalable system for automated machine learning

NCJ Number
308341
Journal
IEEE Transactions on Big Data Volume: 2017 IEEE International Dated: 2017 Pages: 151-162
Date Published
January 2024
Annotation

The authors present Auto-Tuned Models for automated machine learning; they describe the purpose of their research and demonstrate the effectiveness of their system compared to human-generated solutions.

Abstract

In this paper, the authors present Auto-Tuned Models, or ATM, a distributed, collaborative, scalable system for automated machine learning. Users of ATM can simply upload a dataset, choose a subset of modeling methods, and choose to use ATM's hybrid Bayesian and multi-armed bandit optimization system. The distributed system works in a load-balanced fashion to quickly deliver results in the form of ready-to-predict models, confusion matrices, cross-validation results, and training timings. By automating hyperparameter tuning and model selection, ATM returns the emphasis of the machine learning workflow to its most irreducible part: feature engineering. The authors demonstrate the usefulness of ATM on 420 datasets from OpenML and train over three million classifiers. Their initial results show ATM can beat human-generated solutions for 30 percent of the datasets, and can do so in 1/100th of the time. (Published Abstract Provided)

Date Published: January 22, 2024