|
|
|
|
|
|
Support the ML Engineer!
If you would like to suggest articles, ideas, tutorials, libraries or provide feedback just hit reply or send us an email to a@ethical.institute!
|
|
|
|
|
|
|
|
|
|
Feature engineering is still one of the most powerfull areas of machine learning. In this great Kaggle kernel, Feature Labs data scientist Will Koehrsen provides a comprehensive overview of how to approach the end to end data science workflow using the Home Credit Default Risk Competition dataset.
|
|
|
|
|
|
|
|
|
Great post by Dillon from Paperspace on continuous integration and deployment for AI & machine learning. This was a highly discussed topic in 2018, which will certainly see interesting inovations in 2019. This post breaks down ML CI/CD into the areas of 1) data, 2) hardware, 3) training steps and 4) retraining/online.
|
|
|
|
|
|
|
MLOps = ML Operations
This edition we are focusing on feature engineering, and more specifically the automation side of it. The machine learning feature engineering libraries we're showcasing this week are:
- auto-sklearn - Framework to automate algorithm and hyperparameter tuning for sklearn
- TPOT - Automation of sklearn pipeline creation (including feature selection, pre-processor, etc)
- tsfresh - Automatic extraction of relevant features from time series
- Featuretools - An open source framework for automated feature engineering
|
|
|
|
|
|
|
|