8,230
edits
(→Goals) |
(→Goals) |
||
(3 intermediate revisions by the same user not shown) | |||
Line 18: | Line 18: | ||
We need to (does not have to be in this order): | We need to (does not have to be in this order): | ||
* Assess current situation | * Assess current situation | ||
** Present Webrecorder/Kiwix current activities and projects | ** <s>Present Webrecorder/Kiwix current activities and projects</s> Ilya not available | ||
** Present current sofware stack and how it interacts together | ** Present current sofware stack and how it interacts together | ||
** List and identify the weaknesses (at least the one not clearly identify already) in the current architecture/software | ** List and identify the weaknesses (at least the one not clearly identify already) in the current architecture/software | ||
Line 24: | Line 24: | ||
**Go over the crawler's CLI params to understand how/when to use them (<code>docker run --rm -it ghcr.io/openzim/zimit:dev crawl --help</code>) | **Go over the crawler's CLI params to understand how/when to use them (<code>docker run --rm -it ghcr.io/openzim/zimit:dev crawl --help</code>) | ||
**<s>Status of <bdi>[https://github.com/webrecorder/browsertrix-crawler/issues/207 Success status code on failure]</bdi></s> | **<s>Status of <bdi>[https://github.com/webrecorder/browsertrix-crawler/issues/207 Success status code on failure]</bdi></s> | ||
**Status of [https://github.com/webrecorder/browsertrix-crawler/issues/246 Disable browser updates] | **<s>Status of [https://github.com/webrecorder/browsertrix-crawler/issues/246 Disable browser updates]</s> Fixed in Zimit, but not yet upstream in Browsertrix | ||
**Status of [https://github.com/webrecorder/browsertrix-crawler/issues/159 SSLError] | **<s>Status of [https://github.com/webrecorder/browsertrix-crawler/issues/159 SSLError]</s> | ||
**[https://github.com/openzim/warc2zim/issues/109 First access to warc2zim file doesn't correctly catch external links] | **[https://github.com/openzim/warc2zim/issues/109 First access to warc2zim file doesn't correctly catch external links] | ||
Line 32: | Line 32: | ||
** [https://github.com/openzim/warc2zim/issues/65 How communicate to a user the boundaries of a ZIM?] | ** [https://github.com/openzim/warc2zim/issues/65 How communicate to a user the boundaries of a ZIM?] | ||
** [https://github.com/openzim/zimit/issues/126 Should we still use Service workers?] | ** [https://github.com/openzim/zimit/issues/126 Should we still use Service workers?] | ||
** [https://github.com/openzim/warc2zim/issues/72 What kind of size optimisation should we run?] | ** <s>[https://github.com/openzim/warc2zim/issues/72 What kind of size optimisation should we run?]</s> WONTFIX | ||
** [https://github.com/openzim/warc2zim/issues/104 Assess pseudo namespaces] | ** [https://github.com/openzim/warc2zim/issues/104 Assess pseudo namespaces] | ||
**<bdi>[https://github.com/openzim/zimit/issues/166 Should we accept invalid HTTPs?]</bdi> | **<bdi>[https://github.com/openzim/zimit/issues/166 Should we accept invalid HTTPs?]</bdi> |
edits