NATURE – Using Data Science to Understand the Film Industry’s Gender Gap

This study analyses data from the online movie database IMDb with a dataset of movie dialogue subtitles to create the largest available corpus of movie social networks (15,540 networks) to study gender bias in on-screen female characters over the past century.

A few key figures

3.57%

The percentage of on-screen 3-person group interactions that take place between three women, according to an analysis of 15,540 movie networks and the IMDb online film database. This figure includes all genres.

40.74%

The percentage of on-screen 3-person group interactions that take place between three men, according to an analysis of 15,540 movie networks and the IMDb online film database. This figure includes all genres.

34%

The percentage of on-screen roles played by women.

6

According to the database studied, for every 6 interactions between women, there are 10 interactions between men. This means there are 60% as many interactions between women as there are between men.

To go further