What's the Headache with Data For ML❓
What is data for ML?
In the industry this is also called: "model-ready data". The key difference between data for ML vs. others is this "readiness", which is typically prepared through preprocessing pipelines, feature transformations and engineering. These pipelines can be part of the models (sklearn preprocessing), or ETL steps (Spark, DBT, Airflow).
In …
Keep reading with a 7-day free trial
Subscribe to The MLnotes Newsletter to keep reading this post and get 7 days of free access to the full post archives.