Difference between revisions of "Mediawiki DumpHTML extension improvement"

Jump to navigation Jump to search
no edit summary
Line 16: Line 16:


== Challenges ==
== Challenges ==
Consequently, almost nobody uses it right now to generate ZIM files, this is too complicated and buggy. This, although a lot of people want to do that and contact the Kiwix dev. Team to help them to make a ZIM of their own content. But not only external projects would benefit from such a development, we would also gain a lot in efficiency and this would be the first mandatory step to prepare automatically ZIM files.
Consequently, almost nobody uses it right now to generate ZIM files, this is too complicated and buggy. This, although a lot of people want to do that and contact the Kiwix dev. Team to help them to make a ZIM of their own content.  
 
With this project we to offer:
* A way for everyone to generate ZIM files with Mediawiki
* the simpliest solution to use for end user
* the simpliest solution to use and deploy for Mediawiki administrators
* the solution offering the best HTML rendering quality
* the best designed solution simply to maintain and easy to improve
* the most performat solution in term of generating speed


== Workpackages ==
== Workpackages ==
=== Workpackage1: Revamping and fixing bugs ===
 
=== 1 - Revamping and fixing bugs ===


The worth point is that the DumpHTML extension is not correctly maintained and with the time, [https://bugzilla.wikimedia.org/buglist.cgi?query_format=advanced&list_id=2671&component=DumpHTML&resolution=---&product=MediaWiki%20extensions many issues were discovered]. Currently, the extension is not really usable without fixing/tweaking the Mediawiki code.
The worth point is that the DumpHTML extension is not correctly maintained and with the time, [https://bugzilla.wikimedia.org/buglist.cgi?query_format=advanced&list_id=2671&component=DumpHTML&resolution=---&product=MediaWiki%20extensions many issues were discovered]. Currently, the extension is not really usable without fixing/tweaking the Mediawiki code.
Line 33: Line 42:
* ~ 4000 euros
* ~ 4000 euros


=== Workpackage2: phpzim creation an integration in DumpHTML extension ===
=== 2 - phpzim creation an integration in DumpHTML extension ===


phpzim would be a new php module allowing to create/write and read ZIM file directly in PHP. This would be a binding of the zimlib, like pyzim in Python. With this library done, we will be able to create ZIM file directly from the DumpHTML.
phpzim would be a new php module allowing to create/write and read ZIM file directly in PHP. This would be a binding of the zimlib, like pyzim in Python. With this library done, we will be able to create ZIM file directly from the DumpHTML.

Navigation menu