If I try to access up to 1001 page or greater in "discover/movie" method I get an error "page must be less than or equal to 1000". if I use the "movie" method then I can iterate where I need to, but I can’t apply a filter to this method and it also produces a lot of deleted movies. What to do then?
Can't find a movie or TV show? Login to create it.
Want to rate or add this item to a list?
Not a member?
Reply by Travis Bell
on November 25, 2018 at 11:37 AM
Most of our list methods (search, discover, popular, etc...) are capped to the first 20,000 items for performance reasons. Elasticsearch struggles with this and so it is restricted in code. Newer versions of ES have improved this a lot but we're still on quite an old version.
The advice I have given people in the past is that if they need data on more than 20,000 items like this, it's probably better to bring the data in house and then you can run whatever queries/filters you want locally. Ie. by iterating over all of the available media ids.
I don't understand this comment, can you elaborate? Do you mean that by iterating over the ids that you get a lot of 404's? If so, that's exactly why the daily ID exports exist. Grab a recent file and iterate over these ids instead, while there could be a few 404's (those deleted since the file was created) you're basically going to be 404 free.
Reply by editbatic
on November 25, 2018 at 12:12 PM
Yes, 404. How to open json.gz file?
Reply by Travis Bell
on November 25, 2018 at 1:03 PM
Not sure what platform you're on, best to just Google how to extract a .gz file. It's a gzipped file.