Logistical Concerns for Deep Learning
Deep Learning
Overview
This week we talked about non-deep learning tasks that are necessary for effective modeling.
- Data cleaning and storage
- Pandas, Polars, and Databases
- Model evaluation and test set construction
- Structure in the data and information leakage
- Case Study: Workflow to predict viral infection potential
- Separating data prep from modeling
- Using cross validation folds stratified with respect to species
- How do we think about predicting cases that were not observed?
- Example pipeline
- Tracking Hyperparameters and Experiments
- General workflow
- KerasTuner, Ray, and Ax
- Example in Ax
Meeting Notes
- The slides from today can be found here.