The AI Co-Scientist Making ML More Accessible for Your Research
AI Co-scientist that Democratizes Machine Learning for Researchers
In our previous post, we introduced Curie: an open-sourced AI co-scientist that automates experimentation and accelerate the journey from idea to validation. Today, we’re thrilled to introduce Curie’s AutoML feature, designed to help researchers rapidly test hypotheses and unlock insights from their valuable data.
We built this AutoML feature after personally seeing how even highly-capable researchers—in fields like biology, materials science, and chemistry—struggle to apply machine learning to their work. They often have valuable data, but lack the advanced ML knowledge to fully explore its potential. Despite their expertise, the technical barriers make ML feel out of reach, and this is the exact problem that we wanted to solve. Curie can create end-to-end machine learning solutions for non-ML experts, and we have evidence of it already making a positive impact with our early adopters.
Introducing Curie’s New Feature: Your Automated ML Solution
Curie goes beyond traditional parameter or architecture search based AutoML, and takes over all the tedious but critical work performed by ML practitioners.
From data preprocessing, to model and hyperparameter selection, to training and deploying recipes, which all require deep expertise. It’s a vast search space to find the best performing solution, often involving iterative experiments and specialized intuition to fine-tune all the different components in the pipeline. Curie aims to automate this. The goal is to democratize access to powerful ML capabilities for researchers.

All you need to do is input your research question and the dataset path to Curie:
python3 -m curie.main -q 'E.g. How to improve my prediction accuracy on my dataset. Checkout <paper.pdf> for the background information.' \
--dataset_dir <abs_path_to_your_dataset> \
--workspace_name <[optional] abs_path_to_your_codebase> \
--task_config curie/configs/mle.json
From there, it will work to generate the optimal ML solution for your specific problem given the budget.
Curie in Action: Demonstrated Performance
We’ve benchmarked Curie on several ML tasks—including several from MLE-Bench—to demonstrate its capabilities:
- Identifying Melanoma (Cancer) in Images of Skin Lesions
- Predict the severity level of diabetic retinopathy based on retinal images
- Histopathologic Cancer Detection
- Stock Price Prediction
More AutoML use cases can be found here.
Case Study: Skin Cancer Detection Challenge
Here’s a preview of an auto-generated report from Curie. You can scroll through it to see the detailed analysis and insights:
Curie demonstrated some impressive capabilities in the skin cancer detection challenge:
- It managed to train a model achieving a remarkable 0.99 AUC (top 1% performance) using 2 hours. Moreover, the agent intelligently explored a variety of models with early stopping strategies on dataset subsets to quickly gauge potential to efficiently navigate the vast search space of possible models.
- It incorporated data augmentation (random rotation, color jitter, …) to enhance model generalization.
- It provided detailed analysis on performance versus system trade-offs, offering insights for efficient model deployment strategies.
Despite the strong performance, there are areas where Curie can evolve.
-
The current model architectures explored were relatively basic, and the specific ML problem, while important, is a well-established one. It’s possible the task wasn’t as challenging as some newer, more complex problems. The true test will be its performance on more diverse, real-world datasets.
-
Looking ahead, a crucial area for improvement lies in enhancing the agent’s hypothesis generation capabilities. We’re keen to see it explore the search space beyond established empirical knowledge, which will be key to unlocking even higher levels of accuracy and tackling more novel challenges.
Join Us & Contribute to Open Science
And this is where we need your help! We believe in open science, and your usage of Curie will help us make it better and support more scientists. We also welcome contributions to our repo, we can’t keep up with all the latest research by ourselves!
@misc{kon2025curierigorousautomatedscientific,
title={Curie: Toward Rigorous and Automated Scientific Experimentation with AI Agents},
author={Patrick Tser Jern Kon and Jiachen Liu and Qiuyi Ding and Yiming Qiu and Zhenning Yang and Yibo Huang and Jayanth Srinivasa and Myungjin Lee and Mosharaf Chowdhury and Ang Chen},
year={2025},
eprint={2502.16069},
archivePrefix={arXiv},
primaryClass={cs.AI},
}