The Institute for Ethical AI & Machine Learning

Subscribe to the Machine Learning Engineer Newsletter

Receive curated articles, tutorials and blog posts from experienced Machine Learning professionals.

THE ML ENGINEER 🤖
Issue #22

This week in Issue #22:

A conversation on practical NLP, ML reproducibility infrastructure, Berkeley on Serverless, Face detection in Python, Human-Centric ML Infrastructure, tutorial on CNNs, lambda frameworks, upcoming ML conferences, data science / ML engineering jobs and more 🚀.

Support the ML Engineer!

Forward the email, or share the online version on 🐦 Twitter, 💼 Linkedin and 📕 Facebook!

If you would like to suggest articles, ideas, papers, libraries, jobs, events or provide feedback just hit reply or send us an email to a@ethical.institute! We have received a lot of great suggestions in the past, thank you very much for everyone's support!

A conversation on practical NLP

A great conversation with SpaCy co-creator Ines Montani in the "This week in ML (TWiML)" podcast. In this session, Ines dives into the challenges and trends within the world of NLP, as well as the need for "industry-ready" tools in the NLP space that provide more than just research capabilities. During this podcast they also cover really interesting areas, such as the word vector algorithm they created, which they named Language Modelling with Approximate Outputs (aka LMAO).

"I don't like notebooks" @ ICLR19

Really insightful introduction to the need for reproducibility infrastructure at the ICLR2019. This presentation covers some of the tools and techniques to tackle this challenge of reproducibility, as well as an insight on several other challenges that are starting to appear in the ML space such as explainability. The video for 2018 is available as well as the slides for 2019.

A Berkeley view on serverless

After Berkeley's successful report a few years back that demystified the concept of cloud computing, they have put together a new report that aims to do the same for serverless technologies. They provide a high level framework to differentiate normal cloud computing with serverless with three key characteristics:1) Decoupling of computation and storage; they scale separately and are priced independently. 2) The abstraction of executing a piece of code instead of allocating resources on which to execute that code. 3) Paying for the code execution instead of paying for resources you have allocated to executing the code.

Face detection in Python OpenCV

A great hands on tutorial from Towards Data Science covering core fundamentals in computer vision as well as hands on examples, which will allow you to have a functional face detection algorithm by the end. The post covers face detection with Haar Cascade Classifiers using OpenCV, Histogram of Oriented Gradients using Dlib and Convolutional Neural Networks.

Human-Centric ML Infrastructure

Great deep dive by Ville Tulus from Netflix, providing key knowledge on MLOps core concepts, as well as the needs for ML operations frameworks. Ville also introduces a tool that is being used internally within Neflix to manage their machine learning infrastructure.

A tutorial on Convolutional NNs

The MNIST handwritten digit classification problem is a standard dataset used in computer vision and deep learning. Machine learning mastery provides a hands on tutorial that shows how to tackle this challenge using convolutional neural networks.

MLOps = Featured OS Libraries

This week's edition is focused on new libraries on Function as a Service Frameworks which fall on our Responsible ML Principle #4. The four featured libraries this week are:

OpenFaaS - Serverless functions framework with RESTful API on Kubernetes
Fission - Serverless functions as a service framework on Kubernetes
Hydrosphere ML Lambda - Open source model management cluster for deploying, serving and monitoring machine learning models and ad-hoc algorithms with a FaaS architecture
Hydrosphere Mist - Serverless proxy for Apache Spark clusters

If you know of any libraries that are not in the "Awesome MLOps" list, please do give us a heads up or feel free to add a pull request!

MLConf = Conferences & Events

We feature conferences that have core ML tracks (primarily in Europe for now) to help our community stay up to date with great events coming up.

Technical Conferences

PyCon + PyData Florence [02/05/2019] - Python X comes this year with a PyData focus in Florence, Italy.

AI Conference Beijing [18/06/2019] - O'Reilly's signature applied AI conference in Asia in Beijing, China.

RAAIS 2019 [28/06/2019] - The Research and Applied AI Summit in London, UK

Data Natives [21/11/2019] - Data conference in Berlin, Germany.

ODSC Europe [19/11/2019] - The Open Data Science Conference in London, UK.

Spacy IRL [05/07/2019] - SpaCy NLP's First F2F Conference in Berlin, Germany.

EurNLP [11/10/2019] - Europe's NLP research conference (pronounced "Your NLP") in London, UK

Khipu AI [11/11/2019] - Latin American Meeting in Artifical Intelligence in Montevideo, Uruguay.

Business Conferences

World Summit AI Americas [10/04/2019] - Large scale AI summit in Montreal, Canada.
- Come join our panel on AI Ethics and Tools.

AI Expo Global [19/04/2019] - Global conference on artificial intelligence in London, UK.
- Come join us at our talk on AI orchestration at scale.

Predictive Analytics World [18/11/2019] - Conference for Business AI in Berlin, Germany.

Big Data LDN 2019 [13/11/2019] - Conference for strategy and tech on big data in London, UK.

MLJobs = Jobs & Careers

We showcase Machine Learning Engineering jobs (primarily in London for now) to help our community stay up to date with great opportunities that come up. It seems that the demand for data scientists continues to rise!

Leadership Opportunities

Algorithmia is hiring for a VP of Engineering in Seatle, USA
Fractal Labs is hiring for a VP of Engineering in London
Distributed is hiring for a VP of Engineering in London
FactMata is hiring for a Head of Machine Learning in London
Brainpool.ai is hiring for a Head of Machine Learning in London, UK
Cytora is hiring for a Data Science Director in London

Mid-level Opportunities

Proportunity is hiring for a Senior Machine Learning Engineer in London
Twitter is hiring for a Senior Machine Learning Engineer in London
Atlas ML is hiring for a Lead NLP Engineer in London
StreetBees is hiring for a Senior Data Scientist in London
Expedia is hiring for a Principal Data Scientist in London
QuantumBlack is hiring for a Senior Machine Learning Engineer in London
Tractable is hiring for a Senior Deep Learning Engineer

Junior Opportunities

Seldon is hiring for a Machine Learning / Data Engineer in London
Migacore is hiring for a Machine Learning Engineer in London
CloudNC is hiring for a Machine Learning Engineer in London
Babylon Health is hiring for a Machine Learning Engineer in London
Chattermill is hiring for a Machine Learning Engineer in London