Link Search Menu Expand Document

Speaking her Language: Using NLU to build conversational products for girls in developing countries

Partner:Alexander Fulcher, Girl Effect, Non-Profit

Overview

Project Description

Girl Effect builds products for girls in developing markets. We work in Rwanda, India, Ethiopia, South Africa and Tanzania. Our product portfolio contains magazines, television shows, music videos, websites, chatbots and social media channels, all designed to drive uptake of knowledge and change in attitude around key subject areas defined in girl effect’s theory of change.

We try to make products which put our users at the front and centre of the experience. In our digital products, this means allowing girls to share their feedback and guide their own user journeys, rather than be passive recipients of content. As a result of this, we have collected a lot of unstructured qualitative data from our users. In Big Sis, our chatbot product in South Africa, we are now using this data to enable girls to write their sex and relationships questions which we run gainst a dictionary we have created based on existing training data and provide relevant content suggestions.

The challenge that we face is that the submissions we get are in multiple languages, use lots of slang, and often contain misspellings. Building up dictionaries of intent has to reflect the actual language that these girls are using to describe specific subject matter areas. It also means that with each new language we have to begin the same process again. There are very few existing models that are able to cater for this type of engagement.

There are many different use cases we would like to explore to build on the conversational experiences which we are creating: continuing with our subject specific classification to a greater level of nuance and accuracy, but also looking at signs of change expressed by users as a result of their engagement, and also to be able to provide for natural moments of ‘smalltalk’ conversations which help users feel confortable and guide them through their experience.

In order to do this in a more scalable way, we need assistance from data scientists with an expertise in Natural Language Processing to develop appraches to identifying common patterns in the user submissions we have across the different languages and use cases we work with, to establish the extent to which it would be possible to build responses based on a common underlying set of interactions.

Expected Deliverable

We would like to see a vaiety of models and approaches to the challenge using the available data, leading to recommendations as to next steps for implementation within our products.

What would a successful semester look like to you?

To have a clear sense of the feasibility of solving the challenge based on our user data and a recommendation as to the best approach.

Additional Skills from ideal candidates

Understanding of cross language NLU problems.

Data

Models

Conclusion