Arrière

Presentation

A review of machine learning commands in Stata: Performance and usability evaluation

Giovanni Cerulli

7 September 2023

Session

This presentation provides a comprehensive survey reviewing machine learning (ML) commands in Stata.

I systematically categorize and summarize the available ML commands in Stata and evaluate their performance and usability for different tasks such as classification, regression, clustering, and dimension reduction. I also provide examples of how to use these commands with real-world datasets and compare their performance. This review aims to help researchers and practitioners choose appropriate ML methods and related Stata tools for their specific research questions and datasets, and to improve the efficiency and reproducibility of ML analyses using Stata. I conclude by discussing some limitations and future directions for ML research in Stata.

Speaker

Giovanni Cerulli