Movie IMDb Rating Predictor
EECS 349, Machine Learning
Zhilin Chen
Before using any machine learning techniques to predict the IMDb ratings of some movies, the raw data we've collected has already provided us many important information. For example, we could figure out the most successful/welcome writers/directors/actors/genres in specific years. These information could also provide us an insight of how the taste of audiences changes among years
These ratings depend on the average IMDb ratings of the related movies. It is simple but useful enough to provide us a basic knowledge of how these attributes affect the IMDb ratings. Following tables are the sample results we get from conducting plain analysis on raw data:
1)
2006 | 2007 | 2008 | 2009 | 2010 | |
---|---|---|---|---|---|
1st | Documentary(7.22) | Biography(7.39) | War(7.25) | Short(7.52) | Documentary(7.48) |
2nd | War(7.08) | Documentary(7.30) | Biography(7.11) | War(7.15) | History(7.40) |
3rd | Short(7.07) | Music(7.25) | History(6.8) | Biography(7.02) | Biography(7.23) |
2)
2006 | 2007 | 2008 | 2009 | 2010 | |
---|---|---|---|---|---|
1st | C. Nolan(8.5) | M. Loades(9.0) | C. Nolan(9.0) | A. Ruiz(8.7) | C. Nolan(8.8) |
2nd | J. Tusty(8.5) | A. Khan(8.5) | N. Pandey(8.4) | L. Psihoyos(8.5) | A. Kapadia(8.6) |
3rd | M. Scorsese(8.5) | J. Laurence(8.4) | A. Stanton(8.4) | R. Hirani(8.4) | V. Motwane(8.3) |
3)
2006 | 2007 | 2008 | 2009 | 2010 | |
---|---|---|---|---|---|
1st | M. Majoros(8.5) | S. Kass(9.0) | J. Nolan(9.0) | A. Ruiz(8.7) | C. Nolan(8.8) |
2nd | J. Nolan(8.5) | A. Gupte(8.5) | C. Nolan(9.0) | M. Monroe(8.5) | M. Pandey(8.6) |
3rd | A. Mak(8.5) | S. Penn(8.2) | N. Pandey(8.4) | V. Chopra(8.4) | L. Unkrich(8.3) |
4)
2006 | 2007 | 2008 | 2009 | 2010 | |
---|---|---|---|---|---|
1st | H. Jackman(7.32) | J. Brolin(7.69) | A. Jolie(7.36) | E. Bana(7.42) | M. Wahlberg(6.96) |
2nd | B. Willis(7.16) | M. Harden(7.23) | J. Black(7.00) | C. Plummer(7.40) | J. Baruchel(6.89) |
3rd | K. Winslet(7.06) | S. Labeouf(6.93) | D. Craig(6.89) | L. Endre(7.23) | R. Hall(6.86) |
5)
Genres | Directors | Writers | Actors | |
---|---|---|---|---|
1st | Biography(7.11) | C. Nolan(8.77) | C. Nolan(8.83) | M. Caine(8.15) |
2nd | War(7.07) | M. Scorsese(7.93) | J. Lindqvist(7.6) | A. Khan(7.96) |
3rd | Documentary(7.05) | D. Fincher(7.73) | A. Jensen(7.58) | C. Bale(7.8) |
4th | History(6.93) | S. Bier(7.56) | D. Cohen(7.4) | L. DiCaprio(7.78) |
5th | Short(6.71) | D. Yates(7.56) | M. Groening(7.39) | M. Ruffalo(7.57) |
We can notice from the image below that annual average IMDb ratings would fluctuate around 6.3. 1992 could be considered as the 'golden year' of movies. What's more, we are witnessing a downward trend of IMDb ratings, which could indicates that the audiences are more fastidious about movies or that the current movies are worse than decades before