The Institute for Ethical AI & Machine Learning

Subscribe to the Machine Learning Engineer Newsletter

Receive curated articles, tutorials and blog posts from experienced Machine Learning professionals.

THE ML ENGINEER 🤖

Issue #40

This week in Issue #40:

Tricking machine learning Malware classifiers
A new paradigm for ML deployment
Five must-know graph algorithms
Survey in fairnes and bias in ML
Google's OSS differential privacy library
Open source differential privacy libraries
AI conferences
ML jobs
+ more 🚀

Forward the email, or share the online version on 🐦 Twitter, 💼 Linkedin and 📕 Facebook!

If you would like to suggest articles, ideas, papers, libraries, jobs, events or provide feedback just hit reply or send us an email to a@ethical.institute! We have received a lot of great suggestions in the past, thank you very much for everyone's support!

Tricking ML Classifiers

The topic of cybersecurity in machine learning has seen an increase in activity in the community due to its critical nature around production systems. This blog post covers a fascinating competition that took place at DEFCON, where participants were tasked with tricking a ML classifier trained to detect malware. In this article the author provides an insight on how the challenge was applied together with the techniques used to succeed.

ML deployment paradigm

Production machine learning systems have proven that the nuanced challenges that are faced when deploying machine learning require a new paradigm. O'Reilly's Mike Loukides does a fantastic job in his latest article to provide an overview to the topic of machine learning deployment, together with insights on how this challenge is currently being tackled.

Five must-know graph algorithms

Although most of the carefully curated datasets that you may come across online may be on relational or key-value store, there has been an ever-increasing interest on graph datasets, as most of the data we interact with on a regular basis will tend to have more complex, and often grap-like structures. This article provides a comprehensible and non-exhaustive list of graph algorithms to get acquaintanced with - these include an intuitive explanation, insights of where they may be relevant and an example code implementation.

Survey fairness and bias in ML

As larger and more critical datasets (and decisions) become part of the machine learning end-to-end production workflow, the challenges with statistical and societal bias/fairness become more complex. This survey provides a very comprehensible deep dive on the concepts and taxonomies around the concepts of "types of bias", "types of discrimination", and "types of fairness", together with how these interact with the different types of machine learning techniques.

Google's OSS differential privacy

The current implications of data privacy and trust has led into reviving interest into extremely fascinating research areas that have existed for decades. This one in particular is differential privacy, a technique that allows for data to be anonymised in a way that still leaves statistical properties which allow for processing on top of the anonymised data, which can lead to improvements in privacy. Google has released a C++ library of ε-differentially private algorithms, which can be used to produce aggregate statistics over numeric data sets containing private or sensitive information.

OSS: Privacy Preserving ML

The theme for this week's featured ML libraries is Privacy Preserving Machine Learning libraries, and we're happy to share brand new libraries into that section. The four featured libraries this week are:

Intel Homomorphic Encryption Backend - The Intel HE transformer for nGraph is a Homomorphic Encryption (HE) backend to the Intel nGraph Compiler, Intel's graph compiler for Artificial Neural Networks

PySyft - A Python library for secure, private Deep Learning. PySyft decouples private data from model training, using Multi-Party Computation (MPC) within PyTorch

Microsoft SEAL - Microsoft SEAL is an easy-to-use open-source (MIT licensed) homomorphic encryption library developed by the Cryptography Research group at Microsoft
Tensorflow Privacy - A Python library that includes implementations of TensorFlow optimizers for training machine learning models with differential privacy

If you know of any libraries that are not in the "Awesome MLOps" list, please do give us a heads up or feel free to add a pull request!

MLConf = Conferences & Events

We feature conferences that have core ML tracks (primarily in Europe for now) to help our community stay up to date with great events coming up.

Technical & Scientific Conferences

EURNLP 2019 [11/10/2019] - European NLP Research summit in London, UK.

Data Natives [21/11/2019] - Data conference in Berlin, Germany.

ODSC Europe [19/11/2019] - The Open Data Science Conference in London, UK.

Khipu AI [11/11/2019] - Latin American Meeting in Artifical Intelligence in Montevideo, Uruguay.

Business Conferences

Predictive Analytics World [18/11/2019] - Conference for Business AI in Berlin, Germany.

Big Data LDN 2019 [13/11/2019] - Conference for strategy and tech on big data in London, UK.

MLJobs = Jobs & Careers

We showcase Machine Learning Engineering jobs (primarily in London for now) to help our community stay up to date with great opportunities that come up.

Leadership Opportunities

Algorithmia is hiring for a VP of Engineering in Seatle, USA
Fractal Labs is hiring for a VP of Engineering in London

Mid-level Opportunities

Seldon is hiring for a Senior Machine Learning Engineer in London
Proportunity is hiring for a Senior Machine Learning Engineer in London
Atlas ML is hiring for a Lead NLP Engineer in London
StreetBees is hiring for a Senior Data Scientist in London
Tractable is hiring for a Senior Deep Learning Engineer

Junior Opportunities

Migacore is hiring for a Machine Learning Engineer in London
Babylon Health is hiring for a Machine Learning Engineer in London