Abhinay's Web-Dev Blog: FREE! PHP IMDb Scraper/API for new IMDb Template

Saturday, October 30, 2010

FREE! PHP IMDb Scraper/API for new IMDb Template

IMDb is undoubtedly the leading information source for media information and is the top target of web scraping for movie lovers around the world. Unfortunately IMDb does not provide an API to access its database so web scraping is the only resort for us. PHP being one of the most commonly used and powerful web development language enables easy web scraping with the power of PCRE (Perl Compatible Regular Expressions).

For my recent project on a Movie Catalog (http://movies.abhinayrathore.com), I needed a IMDb scraper and found one built by Tyler Hall. His version was not robust enough to scrap all kind of movie pages so I extended it and made it more robust to support different type of titles, BUT recently IMDb changed its page template and most of the old scrapers stopped working including mine. So, I modified my scraper to accommodate the new template changes and considered it as my moral responsibility to contribute back to the developer community.

This new scraper is very robust and capable enough to handle a wide variety of new template modifications. Apart from the regular information it even goes deep to scan extra media images and release dates.

Click here for a Demo

Last Updated: Feb 1, 2014

Major changes in Feb 20, 2013 version:

Now we use the combined information page to scrape the data. This page doesn't change quite often and we can get complete list of individual departments.
Add a few more entities; producers, musicians, cinematographers, editors etc. Removed metascore information. Removed small poster url.
You can now pass a second boolean parameter to the getMovieInfo() and getMovieInfoById() functions to disable the extra information. By default it is set to true and may slow down the scraping. If you don't need all the extra info like Storyline, Release Dates, Recommendations or Media Images, just pass false as second parameter to these methods. Example $movieArray = $imdb->getMovieInfo("The Godfather", false);.
Information for individuals in the list of directors, cast, writers etc. is now in an associative array with key being the IMDb id of the individual.

Here is a list of all the attributes it scraps from the IMDb page:

TITLE_ID
TITLE
YEAR
RATING
GENRES
STARS
DIRECTORS
WRITERS
CAST
PRODUCERS
MUSICIANS
CINEMATOGRAPHERS
EDITORS
ALSO_KNOWN_AS
RELEASE_DATE
RELEASE_DATES
PLOT
POSTER
POSTER_LARGE
RUNTIME
TOP_250
OSCARS
AWARDS
NOMINATIONS
STORYLINE
TAGLINE
MEDIA_IMAGES
MPAA_RATING
VOTES
RECOMMENDED_TITLES
VIDEOS

How to use this PHP Scraper?
Include the class file on your php page
include("imdb.php");
Instantiate the class and get the results in an array:
$imdb = new Imdb();
$movieArray = $imdb->getMovieInfo("The Godfather");

You can try this scraper on my lab page: http://lab.abhinayrathore.com/imdb/

To download the PHP Source Code directly use this link: http://lab.abhinayrathore.com/imdb/imdb_php.htm

Fork it on GitHub: https://github.com/abhinayrathore/PHP-IMDb-Scraper

Example usage: http://lab.abhinayrathore.com/imdb/usage.htm

Proxy script for downloading or displaying Media images on your website: http://lab.abhinayrathore.com/imdb/imdbImage.txt

To implement you own IMDb Web Service API to return data in XML, JSON or JSONP format, use this script along with the API: http://lab.abhinayrathore.com/imdb/imdbWebService.htm

To implement IMDb.com's search suggestions on your website, please follow this post: http://web3o.blogspot.com/2011/10/imdb-search-suggestions-with-jquery.html

If you find any part of this scraper broken or incorrect, please drop a comment here and I’ll try to fix it as soon as possible.

IMDb has a leechers policy in place for media images. You may not be able to use the URL for some of the images to display on your website. As a workaround you can use a PHP Proxy to display or download those images. I’ve written a small proxy script to grab the images: http://lab.abhinayrathore.com/imdb/imdbImage.txt. To use this script you just need to pass the image URL as a request parameter:
<img src="imdbImage.php?url=<?=$url?>" />

NOTE: For users outside of USA
IMDb will automatically redirect you to titles listed in the language used for release in your country (Read more).
To see films listed under their original titles regardless of your country region you will have to modify this script to scrap the titles from http://akas.imdb.com because http://www.imdb.com will automatically redirect you to your country specific title page.

Happy Scraping :)

del.icio.us Tags: PHP IMDb Scraper,Web Scraping,IMDb

319 comments:

AnonymousNovember 2, 2010 at 2:51 PM
Thanks a lot for this script!
I have zero knowledge of php but got this script running on windows using xampp.

If you use xampp you might see this error:
Call to undefined function: curl_init()

Open php.ini file and uncomment this line:
extension=php_curl.dll

Then restart the server and its fixed!
ReplyDelete
Replies
AnonymousNovember 2, 2010 at 5:37 PM
Could you add the MPAA rating?
ReplyDelete
Replies
AnonymousNovember 6, 2010 at 9:53 AM
I am pretty new to PHP. I got xampp installed and working now. let me know, how to test this scraper ?

Asmaka.
ReplyDelete
Replies
Abhinay RathoreNovember 15, 2010 at 9:51 AM
MPAA Rating included: http://lab.abhinayrathore.com/imdb/imdb.txt
ReplyDelete
Replies
AnonymousNovember 16, 2010 at 7:44 PM
Is it possible to get the full plot, instead of a cut off one?
ReplyDelete
Replies
AnonymousNovember 19, 2010 at 3:11 PM
My server is not located in the USA, can you get it to scrape the USA title, instead of the one from the country it is in?
ReplyDelete
Replies
Abhinay RathoreNovember 19, 2010 at 3:24 PM
It is independent of what country you are in. Internet is same everywhere so it can scrap international movie pages as well.
ReplyDelete
Replies
serhatyolacanNovember 20, 2010 at 8:09 AM
$arr['votes'] = $this->match('/>(([0-9]+),([0-9]+)).*?votes<\/a>/ms', $html, 1);

Is this true?
ReplyDelete
Replies
Abhinay RathoreNovember 20, 2010 at 8:40 AM
serhatyolacan,
Try this:
$arr['votes'] = $this->match('/href="ratings".*?>([0-9]+,?[0-9]*) votes<\/a>\)/ms', $html, 1);
This will match even if there are 10 votes or 500,000 votes.

I've also added votes to the scraping list. Get the latest imdb.txt file from the link above
ReplyDelete
Replies
serhatyolacanNovember 20, 2010 at 8:56 AM
Here is the another question :)

Is it possible to scrap actor pictures?

And cache media + actor images to our servers?

And another question...

Is it possible to call strings maually? For example:

< div class="title" >
< ?php echo $title ? >
< /div >

< div class="actorslist" >
< ?php echo $cast ? >
< /div >

...
ReplyDelete
Replies
serhatyolacanNovember 20, 2010 at 9:22 AM
Actually i know lots of wordpress functions. And i'm thinking wordpress plugin with some options. Or theme functions. I don't like imdbphp2 script. So i found your imdb class.

Plugin will work like this: (You can see in top of sidebar)

http://www.odfi.tv/yabanci-filmler/mutant-gunlukleri-chronicles-2008-turkce-divx-hd-online-izle/

You will use custom field with "movie name" or "movie id" (prefer to use id).

All informations will be saved to sql. So they will be cached with post. If you want to upgrade imdb infos. You just only need to update this post.

This is great idea but i have crap php knowledge. So if possible help please :)
ReplyDelete
Replies
AnonymousNovember 20, 2010 at 9:34 AM
It is not independent of what country you are in, because for the new harry potter movie, i get "Haris Poteris ir mirties relikvijos - 1 dalis" as the title.
ReplyDelete
Replies
Abhinay RathoreNovember 20, 2010 at 11:40 AM
Anonymous, I am using Google to search for the titles on IMDb as it is more accurate then IMDb search, and I believe Google is automatically detecting your country locale and redirecting your request to the locale specific IMDb page.

For example: The Spanish site for the Harry Potter movie is http://www.imdb.es/title/tt0926084/

For a workaround, you might have to modify this parser a little bit. In the first run, let it give you the movie info in your locale. Then take the move id (which is same for all locales) and reformat a new url like http://www.imdb.com/title/tt0926084/ (note imdb.com in place of imdb.es), then use this new .com url directly to parse the movie info in english.

Hope this helps :)
ReplyDelete
Replies
Abhinay RathoreNovember 20, 2010 at 11:48 AM
serhatyolacan, you idea of including the actors images is pretty good, but we cannot include the images in media images because that way it will be difficult to distinguish actors images from other ones. What I am planning to do is to return an associative array with actor name and his/her image. But for that I'll have to include some extra functions which I'll plan to do in the next version.

As for the wordpress plugin, I've never worked on those... but you've given a new idea to explore :)
ReplyDelete
Replies
serhatyolacanNovember 20, 2010 at 12:17 PM
This comment has been removed by the author.
ReplyDelete
Replies
AnonymousNovember 20, 2010 at 1:19 PM
It is not being converted into a different domain, but the title is being translated.

Here is an example:
Red Eye on IMDb is: http://www.imdb.com/title/tt0421239/.

If scrape this url, I get a a title of "Naktinis reisas"

If i visit the url from my server, I see that a new line has been added "Original title: Red Eye"

Here is a paste of the source when viewed on my server(look at line 419 - 432): http://pastebin.com/Xr87t7ny
ReplyDelete
Replies
Abhinay RathoreNovember 20, 2010 at 6:07 PM
serhatyolacan: you can print individual array elements:
< div class="title" >
< ?php echo $movieArray['title'] ? >
< /div >
ReplyDelete
Replies
Abhinay RathoreNovember 20, 2010 at 6:12 PM
Anonymous, I guess you'll have to add another field in the parser to parse Original Title.
Let me know if you want me to send you the regular expression for that.
ReplyDelete
Replies
AnonymousNovember 23, 2010 at 8:30 AM
I have a question about the regular expressions. I am trying to write this code in c# and I couldn't understand the lines like

$arr['title_id'] = $this->match('/id="(tt[0-9]+)\|imdb/ms', $html, 1);

what does the "ms" do in the end of the regex. And also can you tell me what match function do exactly. As I understand it searces for the expression in the html string but I couldn't understand what it returns.

Thanks..
ReplyDelete
Replies
Abhinay RathoreNovember 23, 2010 at 8:41 AM
Anonymous,
I am already working on a C#/ASP.net based IMDb Scraper and it'll be ready in coming weeks... So if you can hold that long, I'll post the new library on this blog :)

To learn more about PHP regular expressions you can search on Google and you'll get all kind of tutorials.
As for the "ms" in the pattern, read more about PCRE pattern modifiers: http://www.php.net/manual/en/reference.pcre.pattern.modifiers.php
ReplyDelete
Replies
AnonymousNovember 23, 2010 at 1:30 PM
I get the general idea about scrapping but I am stuck in the "genre" part. I am putting the regex '/Genre.?:(.*?)(<\/div>|See more)/ms' into the regular expression software and it found nothing. Did the template of imdb changed or what??
ReplyDelete
Replies
AnonymousNovember 24, 2010 at 12:22 AM
Hello.
Can you provide your index.php usage? I have zero experience with php.
Thanks.
ReplyDelete
Replies
n0m3rcyNovember 26, 2010 at 1:21 PM
To get the original title I added the following line:

$arr['orig_title'] = trim($this->match('/<span class="title-extra">\\n(.*?) \\n<i>/ms', $html, 1));
ReplyDelete
Replies
AnonymousNovember 29, 2010 at 3:40 AM
This is incredible, I was going to make a one, probably a crappy one, but no, you did, you are the man. Thanks!
ReplyDelete
Replies
selidoriDecember 1, 2010 at 10:42 AM
On my server say:
"Fatal error: Call to undefined function: str_ireplace() in xxxxxxxx/imdb.php on line 15)"
on imdb.php file.

Line invoked is:
$title = str_ireplace('the ', '', $title);

(tested with provided usage.php).

Any ideas?
ReplyDelete
Replies
Abhinay RathoreDecember 1, 2010 at 10:50 AM
You are using an older version of PHP.
str_ireplace was introduced in PHP 5: http://php.net/manual/en/function.str-ireplace.php

You can even remove/comment out this line and the scraper should work fine without it :)
BUT, its better if you upgrade to the latest PHP version.
ReplyDelete
Replies
selidoriDecember 2, 2010 at 3:26 AM
Speedy, serious, precisely.
I love this man!

I can't upgrade webserver provided, meanwhile my phpinfo() reply with:
PHP Version 5.2.14

If I comment str_ireplace, another error accorred:

Fatal error: Call to undefined function: stripos() in xxxxx/imdb.php on line 18

Of course line 18 is:
if(stripos($html, "302 Moved") !== false)

.... you'll expect a gift for Christmas!
ReplyDelete
Replies
VictorDecember 5, 2010 at 3:33 PM
how can i use this to be stored in a database in mysql?
ReplyDelete
Replies
AnonymousDecember 13, 2010 at 7:49 PM
How can i add the search box? thanks
ReplyDelete
Replies
Abhinay RathoreDecember 15, 2010 at 9:27 AM
Victor, you can search on google for storing data to MySql using PHP. It is out of scope for this project.
ReplyDelete
Replies
Abhinay RathoreDecember 15, 2010 at 9:30 AM
Anonymous, you can look at the html code of the test page (http://lab.abhinayrathore.com/imdb/) on how to add the search box.
ReplyDelete
Replies
AnonymousDecember 18, 2010 at 2:11 PM
Here is the code to get the actors out
http://pastebin.com/pJtY064h
ReplyDelete
Replies
Carlos GomesDecember 21, 2010 at 11:27 PM
This is awesome, Abhinay!
I have one issue, though. The $arr['genres'] seems too complicated for me.
I wanted to insert the genres into a SQL DB, but i cant manage to do it, because the only result $movieArray[genres] is giving me is the word 'Array'.
I was planing to do at the end 'INSERT INTO table_name [...] VALUES $movieArrays[genres] ]...]
Is there anything I can do about it?
Thanks in advance
ReplyDelete
Replies
Abhinay RathoreDecember 22, 2010 at 8:38 AM
Carlos,
You can convert a PHP Array into a comma separated string using implode function:
$value = is_array($value)?implode(",",$value):$value;
(First check is it is an array, if it is then convert it into a comma separated string)
ReplyDelete
Replies
UnknownDecember 22, 2010 at 2:16 PM
Great script! How can use it to scrape imdbTV?
ReplyDelete
Replies
Peter MorrisDecember 23, 2010 at 3:14 PM
Hi

For some reason sometimes the movie poster doesn't appear. There's no error and the path is correct but it just stays blank. IMDB anti-leech maybe?
Anyway I solved it by storing the image in my disc and only then showing it.

Any ideas on how to retrieve the new "Stars" field on IMDB? The cast shows unknown actors most of the time.

Cheers,
Pipanni
ReplyDelete
Replies
Abhinay RathoreDecember 25, 2010 at 12:26 AM
Hey Pedro,
Yes IMDb does have an anti-leech policy.
Also, I've added the code to scrap "Stars" field from IMDb page.
ReplyDelete
Replies
Carly FiorinaDecember 31, 2010 at 2:23 AM
Hi,

Really it is a nice blog, I would like to tell you that you have given me much knowledge about it. Thanks for everything.

Extract Web
ReplyDelete
Replies
WiethoofdJanuary 7, 2011 at 7:04 AM
How is it that when I search for 'House of Flying Daggers' my script returns 'Shi mian mai fu' and your own hosted scraper returns 'House of Flying Daggers'.
ReplyDelete
Replies
Abhinay RathoreJanuary 7, 2011 at 8:52 AM
Wiethoofd,
What country are you located in?
It's because IMDb is redirecting you to your country specific locale page. Example Italian Page: http://www.imdb.it/title/tt0385004/. You can try replacing the locale code with "com" in these url's and try if you can get to the English Page.
ReplyDelete
Replies
WiethoofdJanuary 7, 2011 at 10:22 AM
I'm located in the Netherlands.

Even when I search directly for the IMDb-ID or request the .com/title/ page it returns the Chinese title.

I managed to preg_match the 'Also Known As:' title but in most cases the English/International title is required.

When using the 'IMDb API' it returns the correct title: http://imdbapi.com/?i=tt0385004
ReplyDelete
Replies
Abhinay RathoreJanuary 7, 2011 at 2:44 PM
IMPORTANT: For all the users who are outside of USA...
You might see titles listed in the language used for release in your country (Read more).

To see films listed under their original titles regardless of your country region you will have to modify this script to scrap the titles from http://akas.imdb.com because "http://www.imdb.com" will automatically redirect you to your country specific title.

Additionally, I have modified the script to scan all the AKA Titles as well and try to extract USA Title from that list. The USA_Title may not be the correct one all the time, so you can modify the script to extract the exact titles according to your needs.

Please go ahead and test the new version of this script to see if it works for you.
ReplyDelete
Replies
WiethoofdJanuary 7, 2011 at 3:35 PM
Thanks for the attempted fix, but using the http://akas.imdb.com instead of the http://www.imdb.com doesn't work either. Not in the scraper nor my browsers. The USA Title on the other hand does work.

Howcome the scraper isn't using the http://akas.imdb.com/title/tt0385004/combined page (note the 'combined' part) to scrape all the info, this should contain much more information than the regular movie description page.

Changing the Google I'm Feeling lucky search link to search for 'site:imdb.com' instead of 'imdb' should always result in an imdb-page to scrape.
ReplyDelete
Replies
WiethoofdJanuary 7, 2011 at 4:29 PM
I noticed the IMDb scraper sometimes scrapes a banner for a poster image when there is no poster available (the matching fails?)

I added the next line after $arr['poster'] to get rid of banner-images, if any.
if(preg_match('/^http:\/\/ad.doubleclick.net\//', $arr['poster'])) $arr['poster'] = "";

Try it yourself with the movie 'Kooky'
ReplyDelete
Replies
kevinJanuary 11, 2011 at 1:28 PM
hi,
i would want to give in the IMDB link.
lets say $url.
and i would want to store the imdb info to my database, how do i do this?

i haven't been doing php for a while so this is kind of hard for me
ReplyDelete
Replies
scottulsterFebruary 4, 2011 at 9:08 PM
Thanks very much for this, it's great!

I modified/created two functions for those that don't want to use the entire file and only need the imdb url and thumbnail (well that's what I needed).

http://php.pastebin.com/7xWuZui6
ReplyDelete
Replies
PedroFebruary 7, 2011 at 11:07 AM
Hi

I've been using your scraper on a project of mine and today I came across a bug that's puzzling me. Any movie directed by Roland Emmerich won't show its director. It stays blank. I would try to correct this myself but my regex skills are basic.
You can replicate this bug on any movie by Roland Emmerich: "Stargate", "Independence Day", "Godzilla", "The Day After Tomorrow". :P

Cheers,
Pipanni (Pedro)
ReplyDelete
Replies
Abhinay RathoreFebruary 7, 2011 at 11:26 AM
Pedro,
Thanks a bunch for pointing out this bug.
It was a bug in the regex where it was looking for a closing div "</div>" or "and " for filtering out the directors div container.
And because "Roland Emmerich" contains an "and " in between, it was never stripping out the complete container.

I've fixed the bug and it should be working fine for both directors and writers :)
ReplyDelete
Replies
AnonymousFebruary 24, 2011 at 4:32 PM
is this possible with Tv shows as well? i have been looking over the code and don't see that info... any help would be appreciated
ReplyDelete
Replies
AronFebruary 26, 2011 at 8:12 PM
I love this - but I can't get it to function right out of the gate. I've tested it and the URL pulls up the correct page, but when used in the function getMovieInfo the variable $html always pops a 302 moved error. Are there any compatibility issues with using godaddy that anyone knows about? here's my test link:
http://www.movielint.com/db/imdb_test.php
ReplyDelete
Replies
GuldstrandFebruary 28, 2011 at 5:44 AM
How to get the "Filming Locations" and "Company" (without the links/a-tags)?
ReplyDelete
Replies
GuldstrandFebruary 28, 2011 at 6:00 AM
"is this possible with Tv shows as well? i have been looking over the code and don't see that info... any help would be appreciated"

Yes, it would be great to customize this to only focus on TV shows and TV movies.
ReplyDelete
Replies
PedroMarch 5, 2011 at 6:46 AM
Hello Abhinay

Is there any sure way to tell the difference between a movie and a tv show episode?
I could check the running time and stop a movie insertion in the DB if that value is under 60 minutes but that would be a bit lame.

As soon as my movies site is done I'll let you know, as you're definitely in my "Thanks" list. :)
ReplyDelete
Replies
AnonymousMarch 10, 2011 at 8:51 AM
It appears Google is now blocking scripting attempts to use their service, probably thanks to Bing and the like.
ReplyDelete
Replies
AnonymousMarch 11, 2011 at 3:47 AM
It doesnt work anymore :s
Can we have any news from the creator ?
It was an awesome api :/
ReplyDelete
Replies
Abhinay RathoreMarch 11, 2011 at 7:00 AM
Anonymous,
The scraper seems to be working on my side (USA)... what country are u located in? It might be some local problem.
ReplyDelete
Replies
Pedro AJMarch 11, 2011 at 11:54 AM
The scraper is also fine here in Portugal.
Abhinay, anything about my previous question? (how to distinguish a tv show from a movie)
ReplyDelete
Replies
AnonymousMarch 14, 2011 at 9:39 AM
I am in france, and it's not working for now. I made some change to the google search url, and it worked fine until now, so i will try to change it back tomorrow to see what happen.
ReplyDelete
Replies
AnonymousMarch 14, 2011 at 10:10 AM
I couldn't wait to finish my work to work on it =)

And with the proper url, it's working !
Might be some trouble with the google.fr search maybe... Don't know :s

Thanks anyway for your API, Very usefull one !
ReplyDelete
Replies
UnknownMarch 15, 2011 at 6:48 PM
Big thanks for your library, good work !
ReplyDelete
Replies
AnonymousMarch 22, 2011 at 8:32 AM
I've been using this API for a while and it works great, but I am running across an error for a few new movies, and you can test it inside the demo here on the site.

But movies like 'Season of the Witch' and 'The Chronicles of Narnia: Voyage of the Dawn Treader' continually reproduce an error 'title not on IMDB' but they are.

Not sure what the glitch is.
ReplyDelete
Replies
OliverMarch 23, 2011 at 1:26 AM
Seems like imdb changes something because i didn't get results for all movies i searched for. All movies are not on imdb...
ReplyDelete
Replies
AnonymousMarch 25, 2011 at 5:04 PM
Well the API is using Google's 'Im Feeling Lucky' search so I just replaced that with the actual IMDB url and it seems to have cleared a few errors.
ReplyDelete
Replies
UnknownApril 4, 2011 at 4:35 PM
Hey. Nice script. I thinking: maybe it is possible to make js what will get info from imdb.php without reloading the page and fill some html code with specific entries from info have got? It would be interesting to make such thing.
ReplyDelete
Replies
MartienApril 6, 2011 at 6:32 PM
For the people wanter to scrape TV Shows, here's how to do it:

You'll need:
- episode & series nr.
- to add a line to the search before it is coded to URL

Add the following to you search query: Moviename + " (#" + season + "." + episode + ") (TV Serie)"

It should look like this: "House (#1.4) (TV Series)"

Good luck!
ReplyDelete
Replies
AnonymousApril 22, 2011 at 3:57 PM
I've been looking for a way to catalog a collection of films. A simple catalog at that; just a title, year, director and genre.

As opposed to manually building this database, I considered an semi-automated route.

I decided to build, (we'll call it an "application"), where it takes user input (a movie title), feeds that to your scraper then writes the relevant data to a database. Easy.

Everything was going fine at first. However, while working on this application it appears my server's IP may have been banned from accessing IMDb as I continue to receive "No Title found on IMDb!".

I've tested the scraper without any of my modifications and I still continue to receive that error. Have you seen something like this in the past and how do / did you avoid the same fate with your labs demo?

Thanks.
ReplyDelete
Replies
Abhinay RathoreApril 22, 2011 at 4:54 PM
Anonymous,
I guess you are still using the old version of this scraper.
Try the latest version from top link and see if you get the same error.
ReplyDelete
Replies
AnonymousApril 22, 2011 at 6:26 PM
Hi there Abhinay! Thanks for your reply. I am indeed using the latest version of your scraper. Shortly after I posted my first message, it started to work again. It worked for about 30 minutes I'd say and has now just begun throwing the "No Title found on IMDb!" errors.

Perhaps I am trying to make too many requests within a short amount of time so either Google or IMDb's firewall is temporarily blocking my server's IP?
ReplyDelete
Replies
AnonymousMay 4, 2011 at 10:23 PM
HI Abhinay Rathore, thanx for great script.
I`m using your lab page because I don`t have any knowledge about PHP and I can`t use your source php on my server. If possibl, I will be so happy if you help me to use this php script in miy own server.
thanx
ReplyDelete
Replies
StefaanDMay 12, 2011 at 8:24 AM
Excellent script Abhinay,

I wonder if it would be possible to get the full cast list for a movie. That would mean another scrap as the full cast now resides on a separate page, where it used to be all on one page.

The "new" IMDB layout is a totally different discussion ;-)

It always points to /fullcredits#cast

Ciao
Stef,
ReplyDelete
Replies
AnonymousMay 12, 2011 at 12:30 PM
Hi... great blog!

I'm experiencing problems with it...
I first could run your script + test php without any problem...
I didn't modify anything and now I'm getting:
No Title found on IMDb!
any idea?

Thanks!!!
ReplyDelete
Replies
AnonymousMay 13, 2011 at 11:59 AM
Not working the following error:

Notice: Undefined offset: 0 in C:\wamp\www\IMDB\imdb.php on line 34
ERROR No Title found on IMDb!
ReplyDelete
Replies
AnonymousMay 16, 2011 at 6:09 AM
Script works perfectly.. Stop posting stupid comments.. If it doesn't work, its due you! stop crying and learn PHP & HTML.
ReplyDelete
Replies
AnonymousMay 17, 2011 at 12:33 PM
Hello. Is it possible to add Plot keywords?
ReplyDelete
Replies
AnonymousMay 17, 2011 at 12:35 PM
And is it possible to add reward and nomination counts behind oscars? Thanks for great work...
ReplyDelete
Replies
serhatyolacanMay 23, 2011 at 6:09 PM
// Awards
$arr['awards'] = trim($this->match('/([0-9]+) wins/ms',$html, 1));

// Nominations
$arr['nominations'] = trim($this->match('/([0-9]+) nominations/ms',$html, 1));
ReplyDelete
Replies
serhatyolacanMay 23, 2011 at 6:10 PM
If you need posters names for something:

$arr['poster_name'] = strtolower(preg_replace("/^(http:\/\/.*\/)?/i",'',$arr['poster']));
$arr['poster_small_name'] = strtolower(preg_replace("/^(http:\/\/.*\/)?/i",'',$arr['poster_small']));
$arr['poster_large_name'] = strtolower(preg_replace("/^(http:\/\/.*\/)?/i",'',$arr['poster_large']));
ReplyDelete
Replies
serhatyolacanMay 23, 2011 at 7:30 PM
Full size posters and poster names:

$arr['poster_full'] = substr($arr['poster'], 0, strrpos($arr['poster'], "_V1.")) . "_V1._SY0.jpg";
$arr['poster_full_name'] = strtolower(preg_replace("/^(http:\/\/.*\/)?/i",'',$arr['poster_full']));
ReplyDelete
Replies
HaCk CrAcKMay 29, 2011 at 3:21 AM
You can set a condition to find a way if you search according to a series or movie?
ReplyDelete
Replies
HaCk CrAcKMay 29, 2011 at 3:35 PM
how print the url of the movie imdb?
ReplyDelete
Replies
serhatyolacanMay 29, 2011 at 6:02 PM
< ?php echo $movieArray['imdb_url'] ? >
ReplyDelete
Replies
HaCk CrAcKMay 29, 2011 at 7:58 PM
thanks master! :D

I can not solutions "I'm feeling lucky" feature. Use the new script but still not working :S
ReplyDelete
Replies
AnonymousMay 29, 2011 at 8:13 PM
Awesome script!, thx for sharing!.
ReplyDelete
Replies
AnonymousMay 30, 2011 at 2:27 AM
If someone add Cinematography and Original Music it will be perfect.
ReplyDelete
Replies
HaCk CrAcKMay 30, 2011 at 6:37 PM
I'm working on a new way to find the movie imdb url to avoid the "I'm feeling lucky " google
ReplyDelete
Replies
HaCk CrAcKMay 30, 2011 at 9:19 PM
You can change the language to Spanish from the results?
ReplyDelete
Replies
serhatyolacanMay 31, 2011 at 7:34 PM
I tried to add Cinematography and Original Music and something else too in full credits. But i can't write their functions. Because they are in "/$title_id/fullcredits" page and they are in same table elements. Its not looking like cast tables (so can't copy foreach loop for cast). It seems to write their functions impossible or hard :/
ReplyDelete
Replies
serhatyolacanMay 31, 2011 at 7:49 PM
Movie language:

$arr['language'] = trim(strip_tags($this->match('/Language.?:<\/h4>(.*?)<\/div>/ms', $html, 1)));
ReplyDelete
Replies
serhatyolacanMay 31, 2011 at 7:53 PM
Now trying to add filming location, production in movie page. And cinematography, original music, producer, make up... in /fullcredits page.

It seems like hard for me but i will try :)
ReplyDelete
Replies
HaCk CrAcKMay 31, 2011 at 10:52 PM
$arr['language'] = trim(strip_tags($this->match('/Language.?:<\/h4>(.*?)<\/div>/ms', $html, 1)));

That is not to learn the language of the movie?

I want to know how to get me back for information in Spanish
ReplyDelete
Replies
serhatyolacanJune 3, 2011 at 4:04 AM
Dear Abhinay,

Can you write example code or function for $/fullcredits page (with foreach loop for multiple names and for array) please.

Thank you!
ReplyDelete
Replies
serhatyolacanJune 11, 2011 at 7:33 AM
// Countries

$arr['country'] = array();
foreach($this->match_all('/(.*?)<\/a>/ms', $this->match('/Country.?:(.*?)(<\/div>|>.?and )/ms', $html, 1), 1) as $m)
{
array_push($arr['country'], $m);
}
ReplyDelete
Replies
AnonymousJune 23, 2011 at 11:41 PM
Does this script still work? I tried the example url and it doesn't return anything. I want to use this but also don't want to waste my time on something that doesn't work anymore.
ReplyDelete
Replies
Abhinay RathoreJune 24, 2011 at 7:14 AM
Anonymous, this script does work on my side. Try using the Bing search function instead of the Google one.
ReplyDelete
Replies
serhatyolacanJune 27, 2011 at 2:18 PM
Hello Abhinay. Cast and media images not working. Please check.
ReplyDelete
Replies
RodneyJune 27, 2011 at 2:41 PM
How hard would it be to add in the link for the movie trailer? I've been fiddling with it but can't quite get it.
ReplyDelete
Replies
serhatyolacanJune 27, 2011 at 2:47 PM
Updated to 2.3 and started to work again. Sorry for bothering.
ReplyDelete
Replies
AnonymousJuly 2, 2011 at 4:07 AM
Hello,

since an Update (most likely yesterday) the IMDB Rating is not grabbed correctly anymore. Can you look after it?

Thanks!
ReplyDelete
Replies
Abhinay RathoreJuly 2, 2011 at 6:35 AM
Thanks for the tip Anonymous, I've fixed the rating issue.
ReplyDelete
Replies
AnonymousJuly 2, 2011 at 9:34 AM
Hey Abhinay, had the same RegEx, but couldn't post it in here for the tags. Will you maintain this scraper the next month? I'd like to rely on it for a project :-)

If so, please please keep the API for $scraper->scrapMovieInfo consistent. I grab the HTML with my own code.

BTW: Nice work. It's certainly no fun to write that many regexs.
ReplyDelete
Replies
serhatyolacanJuly 4, 2011 at 5:30 PM
$arr['rating'] = $this->match('/itemprop="ratingValue">([0-9].[0-9])<\/span>/ms', $html, 1);
ReplyDelete
Replies
AnonymousJuly 14, 2011 at 2:21 AM
Can you help me to generate the same XML file as you generated. I needed that code for my project www.flickonline.com. Thanks a bunch.
ReplyDelete
Replies
Abhinay RathoreJuly 14, 2011 at 2:43 PM
Anonymous, you can get the IMDb Web Service API code at: http://lab.abhinayrathore.com/imdb/imdbWebService.htm
ReplyDelete
Replies
AnonymousJuly 17, 2011 at 4:09 AM
Hey, wow script works perfekt only 2 things i have problems.
If i type this i get only "ARRAY" in my php file.
< div class="cast">

< div class="genres">

All others like $movieArray['rating'] oder votes or title,... works perfekt.

Please help me with this problem :)

Greets Daniel
ReplyDelete
Replies
Abhinay RathoreJuly 17, 2011 at 8:19 AM
Daniel, for converting arrays to comma delimited string, use this statement:
implode(",", $movieArray['cast'])
ReplyDelete
Replies
AnonymousJuly 17, 2011 at 10:23 AM
Hey Abhinay Rathore

WOW works perfect now wiht the implode
Big thx for the help and the script

Greets Daniel
ReplyDelete
Replies
Pedro AJJuly 21, 2011 at 11:14 AM
Hi Abhinay

I've been using your script for a while and everything was fine, but one or two months ago the movie poster stopped downloading. I've tried using your php proxy but to no avail. Could IMDB be blocking my website? Everything works fine from localhost. :(

Over
ReplyDelete
Replies
Abhinay RathoreJuly 23, 2011 at 8:42 AM
Pedro, it might be possible that IMDb is monitoring leechers pretty heavily.
What you can try is, in your CURL, try to spoof the user agent. You can search on Google for ways of doing it.
ReplyDelete
Replies
gidizJuly 25, 2011 at 3:53 PM
I haven't read all the comments, but one way to get the orginal title from non-us users it to:

$arr['orginal_title'] = trim($this->match('/class="title-extra">(.*?)</ms', $html, 1));

Works for me.

Great class btw!
ReplyDelete
Replies
AnonymousAugust 8, 2011 at 8:23 AM
fyi 'Votes' don't work anymore.

Tanx for a good script!
ReplyDelete
Replies
Abhinay RathoreAugust 8, 2011 at 1:37 PM
Votes issue fixed!
ReplyDelete
Replies
serhatyolacanAugust 10, 2011 at 4:46 PM
I got some improvements. Here is my edited scrapper:

http://pastebin.com/4Bef6m1R

If i made a mistake please poke :)
ReplyDelete
Replies
Abhinay RathoreAugust 10, 2011 at 5:06 PM
Thanks a lot serhatyolacan,
I've included a couple of additions from your script, but I did not include the full credits as it contains some unwanted information and fetching one more URL would slow down the scraping a bit. But anyone who needs all that info can definitely profit from it :)
ReplyDelete
Replies
WouterAugust 15, 2011 at 2:47 PM
Hi Abhinay,
Great script. Got it working right out the box. Thanks alot. I do however have 1 problem. There are some movies that your script wont scrape. They also wont work on your demo page.

I only have this issue with 2 movies:
tt0458339 : Captain America: The First Avenger
tt1201607 : Harry Potter: Deadly Hallows Part 2

Perhaps im overlooking something, but i can find what is going wrong. Tried both Google & Bing search. PHP is up-to-date. All other movies work fine, only these 2 wont.

Can you help me please.

Regards,
Wouter
ReplyDelete
Replies
Abhinay RathoreAugust 15, 2011 at 3:33 PM
Wouter, I don't see any problem with these titles. They are working just fine here.
ReplyDelete
Replies
GajanAugust 19, 2011 at 6:40 AM
Hi Abhinay,
I have the same problem with these two titles
tt0458339 : Captain America: The First Avenger
tt1201607 : Harry Potter: Deadly Hallows Part 2

Only if i search over the imdb number eg. tt0458339

Regards,
Gajan
ReplyDelete
Replies
Abhinay RathoreAugust 19, 2011 at 10:54 AM
Gajan and Wouter,
The problem with searching using certain title id's is fixed now. Instead of searching for complete url match, its now only searching for IMDb title id's and it can now capture all alternate urls for a title id search.
Also, added a new function "getMovieInfoById" to directly get results from IMDb if you know the id.
ReplyDelete
Replies
AnonymousAugust 22, 2011 at 1:06 PM
@Abhinay, hello sir,

I have a begginer question.

I want to print out the Genres, but when I use
echo $movieArray['genres'].'
;

It give me a text result: 'Array' and this is all I get.

I saw your previous replys, like to implode, but I don't know how to use it, can you give me please a php line wich will echo the specific the genres?

Thank you.
ReplyDelete
Replies
Abhinay RathoreAugust 22, 2011 at 2:32 PM
Anonymous,
Check the usage example file link above.
For your particular problem:
echo implode(", ", $movieArray['genres']);
ReplyDelete
Replies
AnonymousAugust 22, 2011 at 4:09 PM
Thank you Abhinay Rahtore, echo implode(", ", $movieArray['genres']); worked briliant.

I will post here when I finish the project, maybe you will like the result.
ReplyDelete
Replies
serhatyolacanAugust 23, 2011 at 9:46 AM
Hello Ahmed. This function allows full movie url or title id. You will like it :)

http://pastebin.com/sYmiPayy
ReplyDelete
Replies
Sumit DebSeptember 5, 2011 at 4:32 AM
Hello Abhinay.
I am student currently pursuing BTECH-CSE.
I am currently working on a "web crawling and scrapping in PHP" project and would like to have your valuable help and suggestions on the same. I would be grateful if you mail me at
sumitdeb1001@gmail.com
thanks in advance...
ReplyDelete
Replies
AnonymousSeptember 11, 2011 at 1:46 PM
Small fix:
$url= "http://www.google.com/search?q=on+site:imdb.com+" . rawurlencode($title);
ReplyDelete
Replies
AmirReza MohammadiSeptember 17, 2011 at 1:57 AM
Hello Abhinay ,

i need your help :( can you help me? i have 2 page ,
in a first page i have a text filed + submit button , and in second page i have a Movie Title , Movie Rate , Movie Plot and ,,,

now how can i use your code , in first page , when i type imdb link and click in button , show information Title , Movie rate , and ,,, in second page

ps : i started to learn php in few past month
ReplyDelete
Replies
AmirReza MohammadiSeptember 17, 2011 at 3:37 AM
dear Abhinay plz accept my previews comment , and help me :( tnX
ReplyDelete
Replies
Abhinay RathoreSeptember 17, 2011 at 8:43 AM
AmirReza, please go ahead and learn form handling in PHP. That'll help you with this and similar problems in future.
Some examples: http://www.w3schools.com/php/php_forms.asp and http://www.tizag.com/phpT/examples/formex.php
ReplyDelete
Replies
AnonymousSeptember 21, 2011 at 2:26 AM
fyi 'Votes' don't work anymore.
ReplyDelete
Replies
Abhinay RathoreSeptember 21, 2011 at 10:39 AM
Fixed Votes and Plot issue!
ReplyDelete
Replies
AnonymousSeptember 24, 2011 at 5:48 PM
Here is my result partially based on your script, http://plusimdb.com
Thank you a lot.
ReplyDelete
Replies
AnonymousSeptember 24, 2011 at 6:01 PM
Just a suggestion, you could use in your post a tag or something called:

Last update: Day-Month-Year

This way, we will know when you update anything, since you keep the same version at every modify made.
ReplyDelete
Replies
AnonymousSeptember 27, 2011 at 5:16 PM
great work man .. thanks for the latest (sept 22 2011) fix....
ReplyDelete
Replies
AnonymousOctober 3, 2011 at 4:36 AM
Hi. how can i grab the trailer link with this?

Its working great anyways :D
ReplyDelete
Replies
AnonymousOctober 4, 2011 at 3:11 PM
If someone need the Trivia from movies, here it is:

$arr['trivia'] = trim(strip_tags($this->match('/Trivia<\/h4>(.*?)(|<span)/ms', $html, 1)));
ReplyDelete
Replies
AnonymousOctober 6, 2011 at 3:44 AM
here's my version

https://github.com/Islander/Isy_IMDB

cheers abhinay :)
ReplyDelete
Replies
web design companyOctober 13, 2011 at 5:57 AM
Hi Abhi..I would like to tell you that you have given me much knowledge about it. That was really a great script.
ReplyDelete
Replies
M2HOctober 13, 2011 at 9:32 AM
Hi
Can i search by title-id ? How ?
please 1 sample code
search by title name :
http://lab.abhinayrathore.com/imdb/imdbWebService.php?m=Titanic&o=xml

search by title-ID name : ???????
please help thanks
Regards
ReplyDelete
Replies
Abhinay RathoreOctober 13, 2011 at 9:35 AM
M2H, this is a pretty versatile api so you can put title-id in place of the movie name:
http://lab.abhinayrathore.com/imdb/imdbWebService.php?m=tt1234567&o=xml
ReplyDelete
Replies
M2HOctober 13, 2011 at 12:13 PM
This comment has been removed by the author.
ReplyDelete
Replies
M2HOctober 13, 2011 at 12:16 PM
@Abhinay Rathore :
Very very good
Thanks a lot
King Regards.
again thanks :)
ReplyDelete
Replies
sjOctober 15, 2011 at 2:00 AM
i saved ur given file as imdb.php,and then i created test.php:
getMovieInfo("The Godfather");
echo $imdb->arr['title'];
?>

please helpme anyone its not working....
ReplyDelete
Replies
Abhinay RathoreOctober 15, 2011 at 9:14 AM
sj, please refer this: http://lab.abhinayrathore.com/imdb/usage.htm
ReplyDelete
Replies
AnonymousOctober 26, 2011 at 10:46 AM
Its a great piece of code, one question, where do you get the autocomplete results from... can you put a suggest.php sample code?
ReplyDelete
Replies
AnonymousOctober 28, 2011 at 5:43 PM
i've been able to pick up the results from google suggest but not particularly from imdb results.
ReplyDelete
Replies
Abhinay RathoreOctober 28, 2011 at 11:05 PM
Anonymous, I am working on an easy to implement code for pulling search suggestions from IMDb. I'll post the code on this blog pretty soon :) Stay tuned!
ReplyDelete
Replies
Abhinay RathoreOctober 30, 2011 at 11:12 AM
Anonymous, you can find the IMDb search suggestions API here: http://web3o.blogspot.com/2011/10/imdb-search-suggestions-with-jquery.html
ReplyDelete
Replies
AnonymousNovember 2, 2011 at 7:56 AM
Can you please provide the regular expression for locations
ReplyDelete
Replies
JonnyNovember 3, 2011 at 12:28 PM
Hey Abhinay, this is a really great script! It really helps me out :)

Is there a way to input multiple imdb urls and then get the data spit out into columns rather than rows?

Also, how can I get it to separate cast members by a comma rather than a new line (
)?

Thanks!

Thanks!
ReplyDelete
Replies
MarkNovember 13, 2011 at 12:04 PM
Is it possible to use the scrapper to get the top250 or the top action genre or the top 1990's movies?
ReplyDelete
Replies
VincentNovember 19, 2011 at 5:34 AM
Hello,
Thx for this job.
I use this scraper in my free software : xbne (http://passion-xbmc.org/downloads/?sa=view;id=23)
- Is it possible to add trailer ?
- Is it possible to add a tag in :
1) Event images for advertising picture. Ex :(<a title="Mickey Rourke at event...)
2) Thumbs format (Height > Width)
3) Fanart format (Whidth > Heigth)

Thanks..
Vincent
ReplyDelete
Replies
AnonymousDecember 6, 2011 at 8:40 AM
hi how do i display the top 10 releases from im db ? by the way nice and useful script the only problem is you have to specify a imdb id for it to work :) i would like to be able to get the top 10 box office movies in the uk can somebody explain how it can be done please because i am no good with object oriented programming :) thanks
ReplyDelete
Replies
AlejandroFebruary 7, 2012 at 3:08 PM
My request to google is being blocked.
ReplyDelete
Replies
3cinemaFebruary 20, 2012 at 2:47 PM
1000 times thankyou. This is awesome and works great.

All hail Rathore for he has given the gift of awesomeness!
ReplyDelete
Replies
AnonymousFebruary 22, 2012 at 1:30 AM
Here is what I have achieved http://demo.plusimdb.com
ReplyDelete
Replies
JasonMarch 16, 2012 at 11:33 PM
As of this morning it seems that the MPAA is broken again. Any help would be appreciated, I can't seem to figure it out.

Oh and this script is amazing.
ReplyDelete
Replies
JasonApril 2, 2012 at 12:13 PM
MPAA is still broken in the script but I was able to get it working. Thanks again for the great tool!
ReplyDelete
Replies
Abhinay RathoreApril 2, 2012 at 1:58 PM
Fixed MPAA rating issue.
ReplyDelete
Replies
AnonymousMay 15, 2012 at 11:25 AM
Abhinay, thanks for making this freely available. Not only has your API been very reliably effective for my project, but I'm also just learning PHP, so have benefitted greatly from being able to look through how your API works. Thanks!
ReplyDelete
Replies
Mostafa MirbabaieMay 27, 2012 at 6:47 AM
Dear Abhinay

Is your movie catalog script open source? can i have this script.
I have many movies and i use some different software for manage the list but your script is very very perfect and usefull.

Thanks
ReplyDelete
Replies
AnonymousMay 28, 2012 at 4:50 PM
just noticed there is an updated script, but using either, i get no movie found using google.com , google.co.uk or bing.com.

i am only scrapping the imdb number and the rating (to compare vs my own).

any ideas on a fix?

thanks
ReplyDelete
Replies
AdminJune 8, 2012 at 1:19 PM
Hey, can I include this script in a commercial project? Donation to you will be given of course :)
ReplyDelete
Replies
UnknownJune 25, 2012 at 7:24 AM
Script is very good. How do i additionally get the User Reviews too.
ReplyDelete
Replies
AnonymousJuly 8, 2012 at 1:19 AM
Hi!
Is there a way to integrate it in WORDPRESS?
Would be great.

Thanks!
ReplyDelete
Replies
AnonymousJuly 15, 2012 at 10:14 AM
Hi, could you please provide a full cast and crew information.
And also modified scrapper for persons?
ReplyDelete
Replies
kaosnewsAugust 15, 2012 at 6:51 AM
Hi,

I think IMDB have changed something on their site. If there is no poster available it shows the wrong output. For example 'Pepsi Smash' - if you enter this on your demo page it shows facebook images.
ReplyDelete
Replies
AnonymousAugust 15, 2012 at 11:20 AM
Yeap, the imdb changed something because some of the posters can be retrieved anymore, any updates on this? Thank you.
ReplyDelete
Replies
AnonymousAugust 19, 2012 at 7:00 AM
The Release Date is apparently taken from the main page, which may display the release date in another country instead of the original release date.
Example: http://akas.imdb.com/title/tt0219400/ shows the Turkey release date on the main page.

I guess the Release Date would have to be calculated as the earliest date from the releaseinfo page.
ReplyDelete
Replies
kaosnewsAugust 20, 2012 at 2:12 AM
I have also encounter a other 'small' thing. It now also grabs the line 'See full cast and crew' with the stars. This is also new. So you have the poster problem and this thing.
ReplyDelete
Replies
AnonymousSeptember 19, 2012 at 8:02 AM
Hi!

Pls help...

I only need to media images. How can I retrieve it? imdbimage.php might use.

thanks

Best regards

Gergő
ReplyDelete
Replies
Reco-XOctober 1, 2012 at 6:28 PM
How can i use it, if i have a database with 15.000 movies with the link of imdb and i want to copy the data in my own database??

and if i want to add a new movie, how can i do to only put the url of the movie and this script scrap all the data and save into database?
ReplyDelete
Replies
MalcolmNovember 13, 2012 at 5:57 PM
Hi. I am using the imdb.php file and the test file you have for demonstration but it reutrns an error no matter what movie I search for. It seems like the error is happening in the "geturl" function. Can anyone help me figure this out? I am only an intermediate at best with php so please be patient with me! :)
ReplyDelete
Replies
JoanNovember 19, 2012 at 3:58 AM
Awesome script, thank you!!

Why don't use the "combined" (http://akas.imdb.com/title/tt0120338/combined)

There is important information like:

Original Music
Cinematography
Film Editing
Production Companies

Regards!
ReplyDelete
Replies
Dna75December 3, 2012 at 1:25 PM
It was working perfectly but a few days ago the image posters url's stopped working.

Is there a fix for this?
ReplyDelete
Replies
AnonymousDecember 28, 2012 at 6:37 AM
Dear Abhinay Rathore,
first of all, thank you so much for your script. It works & it is flexible.

I need your help. Can you please add info for whether the title is a movie or documentary or tv series?

thanks in advance, BR

CAGRI
ReplyDelete
Replies
AnonymousDecember 29, 2012 at 12:41 PM
Dear Abhinay Rathore and other friends,

Why can not I get a "-" with the code below even if a movie doesn't have a original name in IMDB? What am I doing wrong? thanks in advance, BR
CAGRI

if (!is_null(trim($movieArray['original_title']))) {echo $movieArray['original_title'];} else {echo '-';}
ReplyDelete
Replies
AnonymousDecember 30, 2012 at 4:33 AM
Dear Abhinay Rathore and other friends,

for the movie tt0078771 - IMDB Title:Love on the Run. it is an French movie. Original name is:L'amour en fuite

I use getMovieInfoById function.

My strange detection is:
I get tt0078771's IMDB title as original name & I get original name as IMDB title. I also checked IMDB page (everything seems OK). What can be the reason for this movie's titles place replacement case?

thanks in advance, BR
CAGRI
ReplyDelete
Replies
AnonymousDecember 30, 2012 at 5:36 AM
UPDATE Note: in order to echo also 1 win and/or 1 nomination

I changed these and it works:
$arr['awards'] = trim($this->match('/(\d+) win/ms',$html, 1));

$arr['nominations'] = trim($this->match('/(\d+) nomination/ms',$html, 1));

what i've changed:
wins to win
nominations to nomination

try with tt0115561. it has 1 win only.

BR
CAGRI
ReplyDelete
Replies
AnonymousDecember 30, 2012 at 5:40 AM
Dear Abhinay Rathore and other friends,

update requirement:
if the movie had only 1 oscar, script echo empty. oscars to oscar does no effect because IMDB writes "Won Oscar." if there is only 1 oscar. Since it doesn't use numerical 1 value, result comes empty.

can you please update this issue

thanks in advance, BR
CAGRI
ReplyDelete
Replies
JoanJanuary 4, 2013 at 3:58 AM
Hi Abhinay,

If you update the script this year please think in the "combined" option (http://www.imdb.com/title/tt0088247/combined)

I want to put some important information like:

Original Music
Cinematography
Film Editing
Production Companies

I tried to implement myself with no luck, im very bad at scraping.

Regards and happy new year!
ReplyDelete
Replies
AnonymousJanuary 11, 2013 at 9:18 AM
There still something wrong retrieving Posters. It's not a mistake in the script - but it's something in the website from IMDB. For example if you search for Men Are Such Fools (tt0030433) you will get this URL: http://lab.abhinayrathore.com/imdb/imdbImage.php?url=http://b.scorecardresearch.com/p?c1=2&c2=6034961&c3=&c4=http%3A%2F%2Fwww.imdb.com%2Ftitle%2Ftt0030433%2F&c5=c6=&15=&cj=1
ReplyDelete
Replies
AnonymousJanuary 14, 2013 at 11:37 PM
looks like the script got broken by updates to imdb, any plans to update?
ReplyDelete
Replies

Add comment

Thanks a lot for your valuable comments :)