Difference between revisions of "Talk:Mediawiki DumpHTML extension improvement"

Jump to navigation Jump to search
no edit summary
Line 12: Line 12:


: I never consider tex renderer equations as "images", they should and will be always there. This is easy to achieve with DumpHTML extension, because handlings (equations vs images) are different in the php code. I also do not think this should increase a lot of the ZIM file size at the end. We do not have any issue here I think. [[User:Kelson|Kelson]] 19:17, 31 August 2011 (CEST)
: I never consider tex renderer equations as "images", they should and will be always there. This is easy to achieve with DumpHTML extension, because handlings (equations vs images) are different in the php code. I also do not think this should increase a lot of the ZIM file size at the end. We do not have any issue here I think. [[User:Kelson|Kelson]] 19:17, 31 August 2011 (CEST)
:: Good to know. Actually it was a bug in my app which led to omission of equation images. The old german wikipedia on openzim.org contains equations. It has a little less than 1000000 and its size is only 1.5GB. So I'd also expect that equations don't increase ZIM file size a lot. --[[User:Cip|Cip]] 01:39, 3 September 2011 (CEST)


Independently, it definitely also make sense (not only for mobile use case) to also have additionally zim files with a small selection of images only. I am aware that this
Independently, it definitely also make sense (not only for mobile use case) to also have additionally zim files with a small selection of images only. I am aware that this
Line 18: Line 20:


: Fully agree with you (but only images, no equations). An algorithm to identify "important" pictures and sort them would be great. If you have time to work on that this would be really great. I'm also ready to help you to test the results with test ZIM files. This work should also IMO be granted. [[User:Kelson|Kelson]] 19:17, 31 August 2011 (CEST)
: Fully agree with you (but only images, no equations). An algorithm to identify "important" pictures and sort them would be great. If you have time to work on that this would be really great. I'm also ready to help you to test the results with test ZIM files. This work should also IMO be granted. [[User:Kelson|Kelson]] 19:17, 31 August 2011 (CEST)
:: I'm sorry but I fear that I won't have time to work on this. --[[User:Cip|Cip]] 01:39, 3 September 2011 (CEST)


* Split
* Split
Line 30: Line 34:


: My opinon (1) splited versions of each content should be alway provided (2) We should find a solution if the file system where files are saved is not able to deal with big file to detect it automaticaly (3) Version for mobile should be provided beside the other ones (4) reader/kiwix should be able to know if he needs a version for mobile or not (5) download per default an adequat version (splited or not, mobile version or not). [[User:Kelson|Kelson]] 19:27, 31 August 2011 (CEST)
: My opinon (1) splited versions of each content should be alway provided (2) We should find a solution if the file system where files are saved is not able to deal with big file to detect it automaticaly (3) Version for mobile should be provided beside the other ones (4) reader/kiwix should be able to know if he needs a version for mobile or not (5) download per default an adequat version (splited or not, mobile version or not). [[User:Kelson|Kelson]] 19:27, 31 August 2011 (CEST)
:: I don't understand what you mean with "(2) We should find a solution if the file system where files are saved is not able to deal with big file to detect it automaticaly": If the app is doing the download there is no real benefit in downloading one large file instead of downloading multiple small files. Therefore I'd propose to keep it simple and just always download splitted files (2GB), independent of the target file system. (In particular as it could be pretty difficult to impossible to detect the file system on a mobile device). It may even make sense to also download 2GB-splits on desktop-kiwix as well (As the user can then copy the downloaded files easily to a mobile phone, although I agree that in the desktop-case having a single file also has it benefits.
:: Having additionally a single file is benefical if the user wants to download a zim file manually just using the webbrowser. While this is basically true for both desktop and mobile users, for mobile users the benefit may be pretty limited, as in this case typically smaller (no images) zim files, and so also less separate files need to be downloaded, and there is a pretty good chance that the user finds out only after downloading the large file that it won't work on his mobile.
:: ad "(3) Version for mobile should be provided beside the other ones". What exactly  do you mean with version for mobile? I was thinking that different versions (Like: No images, with image selection (as soon as algorithm is available ;), all images) are available, which can all be used both on desktop or mobile. For sure probably the majority of mobile users will use a non-image version while desktop users may prefer the all-image version, but in the end its up to the user what version she wants to use. --[[User:Cip|Cip]] 01:39, 3 September 2011 (CEST)


2. It is pretty complex to support such a feature in an app. (i.p. on multiple platforms) This may mean that apps just don't support this feature and let it up to the user to download the
2. It is pretty complex to support such a feature in an app. (i.p. on multiple platforms) This may mean that apps just don't support this feature and let it up to the user to download the
Line 37: Line 48:


: Separate download should always be possible. Detecting a mobile device is easy... in fact what matters is the screen resolution (we could introduce a meta-tag with something like minresolution inside). Detecting the nature of the filesystem should be also possible... although I do not know how? An idea? [[User:Kelson|Kelson]] 19:27, 31 August 2011 (CEST)
: Separate download should always be possible. Detecting a mobile device is easy... in fact what matters is the screen resolution (we could introduce a meta-tag with something like minresolution inside). Detecting the nature of the filesystem should be also possible... although I do not know how? An idea? [[User:Kelson|Kelson]] 19:27, 31 August 2011 (CEST)
:: Sorry, I don't get this. Are you thinking about the web page providing the zim downloads? --[[User:Cip|Cip]] 01:39, 3 September 2011 (CEST)
2

edits

Navigation menu