Open-Assistant/notebooks/data-augmentation/movie-descriptions at main · panda2012/Open-Assistant

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
movie_descriptions.ipynb		movie_descriptions.ipynb

README.md

Dataset Summary

This dataset is created by scraping Letterbox (popular film titles) + Wikipedia (film descriptions). This is because the descriptions in Letterbox are from The Movie Database, whose terms prevent the use of their data: https://www.themoviedb.org/terms-of-use.

The dataset format is:

INSTRUCTION: Write a description about the film {film}.
RESPONSE: In his second year of fighting crime, Batman uncovers corruption in Gotham City...

The notebook only contains the instructions to obtain the dataset, other steps must be implemented. The process can take hours or days, so I recommend setting a limit on the number of data to be obtained.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

movie-descriptions

movie-descriptions

README.md

Dataset Summary

Files

movie-descriptions

Directory actions

More options

Directory actions

More options

Latest commit

History

movie-descriptions

Folders and files

parent directory

README.md

Dataset Summary