8,231
edits
Line 49: | Line 49: | ||
==== Downloading subtitles ==== | ==== Downloading subtitles ==== | ||
The subtitles of videos are harder to get. They are all available on [http://www.amara.org/en/teams/ted/videos/ here] in multiple formats. We will use the caption format SRT. | The subtitles of videos are harder to get. They are all available on [http://www.amara.org/en/teams/ted/videos/ here] in multiple formats. We will use the caption format SRT. | ||
==== Building HTML sites out of the scraped content ==== | ==== Building HTML sites out of the scraped content ==== | ||
We want to 'export' our scraped data to html, so we can run the zim tool on it and create compressed zim files off it. | We want to 'export' our scraped data to html, so we can run the zim tool on it and create compressed zim files off it. Out of all the possibilities [http://jinja.pocoo.org/docs/ Jinja2] seems to be the best library for that. | ||
Out of all the possibilities [http://jinja.pocoo.org/docs/ Jinja2] seems to be the best library for that. | |||
==== Javascript client side filter/search solution === | ==== Javascript client side filter/search solution ==== | ||
... | ... | ||
==== Templating solution to create pages ==== | ==== Templating solution to create pages ==== |
edits