Difference between revisions of "TED"

Jump to navigation Jump to search
65 bytes added ,  10 years ago
 
(2 intermediate revisions by the same user not shown)
Line 39: Line 39:
** Creation of a concept including a conceptional Zim file, that demonstrates the very basics of this project {{done}}
** Creation of a concept including a conceptional Zim file, that demonstrates the very basics of this project {{done}}
* Rest of the first week (20. - 23.02.2104):
* Rest of the first week (20. - 23.02.2104):
** Collection of all the data
** Collection of all the data {{done}}
*** Writing the Scraper, that scrapes TED.com  
*** Writing the Scraper, that scrapes TED.com {{done}}
*** Writing the Scraper, that scrapes the TED translation page on ww.amara.org
*** Writing the Scraper, that scrapes the TED translation page on ww.amara.org {{done}}
*** Writing the html templates  
*** Writing the html templates {{done}}
*** Writing a python script, that dumps the scraped data into the HTML pages, creating static content
*** Writing a python script, that dumps the scraped data into the HTML pages, creating static content {{done}}
* First three days of the second week (24. - 26.02.2014):
* First three days of the second week (24. - 26.02.2014):
** Implementing the local database, that manages all the content  
** Implementing the local database, that manages all the content {{done}}
** Implementing the search engine in Javascript, that allows the user to search through all of the content
** Implementing the search engine in Javascript, that allows the user to search through all of the content {{done}}
** Finally: Creating the first prototype zim files
** Finally: Creating the first prototype zim files
* Rest of the second week (27.02 - 2.03.2014):
* Rest of the second week (27.02 - 2.03.2014):
Line 66: Line 66:


==== Downloading subtitles ====
==== Downloading subtitles ====
The subtitles of videos are harder to get. They are all available on [http://www.amara.org/en/teams/ted/videos/ here] in multiple formats. We will use the caption format SRT.
The subtitles of videos are harder to get. They are all available on [http://www.amara.org/en/teams/ted/videos/ here] in multiple formats. We will use the caption format WebVTT.


==== Building HTML sites out of the scraped content  ====
==== Building HTML sites out of the scraped content  ====
21

edits

Navigation menu