Difference between revisions of "TED"
Jump to navigation
Jump to search
Line 14: | Line 14: | ||
# Download videos and re-encode them if necessary | # Download videos and re-encode them if necessary | ||
# Retrieve the video subtitle files | # Retrieve the video subtitle files | ||
## Subtitle don't make so much sense for TEDx | |||
## TED has a translation program [http://www.ted.com/OpenTranslationProject here] | |||
# Create the necessary templates of the index web pages (For the search/filter feature, a javascript client side solution should be tried) | # Create the necessary templates of the index web pages (For the search/filter feature, a javascript client side solution should be tried) | ||
# Fill the HTML templates with the data from the XML/RDF and write the index pages in a target directory | # Fill the HTML templates with the data from the XML/RDF and write the index pages in a target directory | ||
# Run zimwriterfs to create the corresponding ZIM file of your target directory | # Run zimwriterfs to create the corresponding ZIM file of your target directory |
Revision as of 19:56, 16 February 2014
TED (Technology, Entertainment, Design) is a global set of conferences under the slogan "ideas worth spreading". They address a wide range of topics within the research and practice of science and culture, often through storytelling. The speakers are given a maximum of 18 minutes to present their ideas in the most innovative and engaging ways they can. Its web site is www.ted.com. The purpose of this project is to create a sustainable solution to create a ZIM file providing the TED and TEDx videos in a similar manner like ted.com
Goals
- A script (python) able to create easily following ZIM files of the TED and TEDx videos with the possibility to filter by language/conference/topic
- A list of TED talks can be found here
- The data should be scraped from ted.com.
- Videos should be available in HTML5 and subtitles need to be supported
- The ZIM should provide a simple filtering/search solution to find content (by author, language, title, conference, topic, ....)
One way to achieve it
- Retrieve the list of TED(x) presentations with medatas in a local database
- Download videos and re-encode them if necessary
- Retrieve the video subtitle files
- Subtitle don't make so much sense for TEDx
- TED has a translation program here
- Create the necessary templates of the index web pages (For the search/filter feature, a javascript client side solution should be tried)
- Fill the HTML templates with the data from the XML/RDF and write the index pages in a target directory
- Run zimwriterfs to create the corresponding ZIM file of your target directory