Movie IMDb Rating Predictor

EECS 349, Machine Learning

Zhilin Chen

Why plain analysis?

     Before using any machine learning techniques to predict the IMDb ratings of some movies, the raw data we've collected has already provided us many important information. For example, we could figure out the most successful/welcome writers/directors/actors/genres in specific years. These information could also provide us an insight of how the taste of audiences changes among years

Top K Genres/Directors/Writers/Actors between 2006 and 2010

     These ratings depend on the average IMDb ratings of the related movies. It is simple but useful enough to provide us a basic knowledge of how these attributes affect the IMDb ratings. Following tables are the sample results we get from conducting plain analysis on raw data:

    1)

Annual top 3 Genres between 2006 and 2010
     20062007200820092010
1st Documentary(7.22)Biography(7.39)War(7.25)Short(7.52)Documentary(7.48)
2nd War(7.08)Documentary(7.30)Biography(7.11)War(7.15)History(7.40)
3rd Short(7.07)Music(7.25)History(6.8)Biography(7.02)Biography(7.23)

    2)

Annual top 3 Directors between 2006 and 2010
     20062007200820092010
1st C. Nolan(8.5)M. Loades(9.0)C. Nolan(9.0)A. Ruiz(8.7)C. Nolan(8.8)
2nd J. Tusty(8.5)A. Khan(8.5)N. Pandey(8.4)L. Psihoyos(8.5)A. Kapadia(8.6)
3rd M. Scorsese(8.5)J. Laurence(8.4)A. Stanton(8.4)R. Hirani(8.4)V. Motwane(8.3)

    3)

Annual top 3 Writers between 2006 and 2010
     20062007200820092010
1st M. Majoros(8.5)S. Kass(9.0)J. Nolan(9.0)A. Ruiz(8.7)C. Nolan(8.8)
2nd J. Nolan(8.5)A. Gupte(8.5)C. Nolan(9.0)M. Monroe(8.5)M. Pandey(8.6)
3rd A. Mak(8.5)S. Penn(8.2)N. Pandey(8.4)V. Chopra(8.4)L. Unkrich(8.3)

    4)

Annual top 3 Actors between 2006 and 2010
     20062007200820092010
1st H. Jackman(7.32)J. Brolin(7.69)A. Jolie(7.36)E. Bana(7.42)M. Wahlberg(6.96)
2nd B. Willis(7.16)M. Harden(7.23)J. Black(7.00)C. Plummer(7.40)J. Baruchel(6.89)
3rd K. Winslet(7.06)S. Labeouf(6.93)D. Craig(6.89)L. Endre(7.23)R. Hall(6.86)

    5)

Top 5 Genres/Director/Writers/Actors between 2006 and 2010
     GenresDirectorsWritersActors
1st Biography(7.11)C. Nolan(8.77)C. Nolan(8.83)M. Caine(8.15)
2nd War(7.07)M. Scorsese(7.93)J. Lindqvist(7.6)A. Khan(7.96)
3rd Documentary(7.05)D. Fincher(7.73)A. Jensen(7.58)C. Bale(7.8)
4th History(6.93)S. Bier(7.56)D. Cohen(7.4)L. DiCaprio(7.78)
5th Short(6.71)D. Yates(7.56)M. Groening(7.39)M. Ruffalo(7.57)

How average IMDb ratings goes among years?

     We can notice from the image below that annual average IMDb ratings would fluctuate around 6.3. 1992 could be considered as the 'golden year' of movies. What's more, we are witnessing a downward trend of IMDb ratings, which could indicates that the audiences are more fastidious about movies or that the current movies are worse than decades before