AI Ethics and Fairness Resources

AI and data ethics and fairness is becoming a very hot topic lately. With computer vision models not being able to see everyone equally to the debacle at Google's AI division, it's something that we all need to look out for when doing any type of work with data.

With that, I'd like to show some resources I found that has been useful when researching this topic. Some are videos that go over how bias can get into data and others are actual research papers that go over how to help mitigate bias.

For a video version of this post, check below:

Videos

There are quite a lot of videos that go over AI ethics. Below are a few of my favorites that have a good amount of information in them.

  • The Trouble with Bias by Kate Crawford - This talk, given at the Neural Neural Information Processing Systems (NIPS) in 2017. Not only does Kate goes over what exactly is bias in machine learning models, but she also goes over the harms that it can cause.

  • Machine Learning and Fairness by Hanna Wallach and Jennifer Wortman Vaughan - This is actually one of my favorite resources on the list. This video goes into several aspects of fairness in machine learning including types of bias that can be in your data as well as ways to help mitigate it such as the Datasheets for Data paper that's linked in the papers section.

  • Transparency and Intelligibility Throughout the Machine Learning Life Cycle by Jennifer Wortman Vaughan - This goes through the entire machine learning life cycle to best incorporate transparency throughout the life cycle.

Courses

There are a couple of courses that go over AI ethics and I believe more will be on the way as time goes on.

  • FastAI Ethics - FastAI's ethics course is probably one of the most comprehensive out there. It has several lectures and each lecture has supplemental materials such as articles and even research papers.

Books

Just like courses are coming to teach people about AI ethics, books are also coming to do the same and also to help how you can prevent bias from creeping into your models.

  • Interpretable AI by Ajay Thampi - One of the first books I've seen on this subject, this book helps you understand why the need for having models that are interpretable and shows how to do it.

Papers and Documents

A lot of the information in the other categories come from earlier research done on data bias and AI ethics. As a result of the research some documents have also come out of it to help people creating models to mitigate the amount of bias in their data.

  • Manipulating and Measuring Model Interpretability - This paper goes into how to measure model interpretability. It also helps answer the question about what is interpretability in terms of a machine learning model.

  • Datasheets for Datasets - In electronics, there is a datasheet accompanied by each component that describes its characteristics, any testing done on it, etc. This paper proposes the idea of having the same for machine learning data.

  • AI Fairness Checklist - This document has a checklist that one can follow throughout the lifecycle of creating a model to lookout for fairness.

Tools

Thankfully, there are some tools out there that can help us interpret how models are making their predictions as well as assessing fairness within the models.

  • Microsoft Fairlearn - This Python tool helps access the fairness in your data. There is a demo available for this that helps show how it works.

  • Microsoft InterpretML - Another Python tool to help interpret machine learning models. This one also has a demo available.

Hopefully, this list gave you a good idea about data and AI ethics and fairness. There are definitely many more resources out there and I have been partial to Microsoft for their research and resources.

There will be more posts on ethics and fairness in the future, as well. Especially covering the two tools from Microsoft, Fairlearn and InterpretML.

Top 5 Artificial Intelligence Sessions at Build 2018

During the same week PyCon was going on, Microsoft had their annual Build conference. If you're not familiar with this conference this is where Microsoft announces a lot of new things for developers. The main focus of this year's Build was about artificial intelligence.

Also in the same way as PyCon, Microsoft records all of the sessions at Build so we're free to watch them later. With that, I present to you what I think are the top five session at Build that go over artificial intelligence. Why top five instead of top ten like in the PyCon session post? Well, there just wasn't enough to do a top ten. :)

10 Things Developers Need to Know about Building Intelligent Apps by Noelle LaCharite

This session goes a lot into the Cognitive Services. Not only how much it can improve your applications by incorporating them, but also how easy they are to implement.

The Microsoft AI platform: a State of the Union by Joseph Sirosh

Continuing with more information about Cognitive Services, this session goes into some of the other capabilities that were announced at Build. A nice demo they showed went over what they did with the released JFK files and how they used Cognitive Search to analyze all of that data.

DevOps for AI : Deploying everywhere by Paige Bailey

This is an interesting talk that goes into the basics of data science and machine learning, but also goes into how to integrate the mindset of DevOps into data science. Doing so can help with things like testing and version control as a process to doing data science which can help reproducability and putting models into production.

Microsoft AI overview for developers by Harry Shum

This video is full of great demos that show off the power of several Cognitive Services. Although, my favorite demo is the one where they show the power of having a bot on your site.

Demystifying Machine and Deep Learning for Developers by Seth Juarez and Chris Lauren

There are quite a few deep learning videos out there, but I firmly believe that this is one of the best. The way that Seth and Chris describe the deep learning process of the neural network algorithms makes it very understandable to what's going on behind the scenes when you train a model.


So a lot of interesting things happened at build in terms of AI, mostly involving their Cognitive Services APIs. These APIs, I think, are going to really help make your apps stand out from the rest of the crowd and Microsoft looks to continue to add to them with APIs from their Cognitive Services Labs.

I'm definitely looking forward to what else they come up with.