Difference between revisions of "Athens 2023"

Jump to navigation Jump to search
107 bytes added ,  11 months ago
(3 intermediate revisions by the same user not shown)
Line 18: Line 18:
We need to (does not have to be in this order):
We need to (does not have to be in this order):
* Assess current situation
* Assess current situation
** Present Webrecorder/Kiwix current activities and projects
** <s>Present Webrecorder/Kiwix current activities and projects</s> Ilya not available
** Present current sofware stack and how it interacts together
** Present current sofware stack and how it interacts together
** List and identify the weaknesses (at least the one not clearly identify already) in the current architecture/software  
** List and identify the weaknesses (at least the one not clearly identify already) in the current architecture/software  
Line 24: Line 24:
**Go over the crawler's CLI params to understand how/when to use them (<code>docker run --rm -it ghcr.io/openzim/zimit:dev crawl --help</code>)
**Go over the crawler's CLI params to understand how/when to use them (<code>docker run --rm -it ghcr.io/openzim/zimit:dev crawl --help</code>)
**<s>Status of <bdi>[https://github.com/webrecorder/browsertrix-crawler/issues/207 Success status code on failure]</bdi></s>
**<s>Status of <bdi>[https://github.com/webrecorder/browsertrix-crawler/issues/207 Success status code on failure]</bdi></s>
**Status of [https://github.com/webrecorder/browsertrix-crawler/issues/246 Disable browser updates]
**<s>Status of [https://github.com/webrecorder/browsertrix-crawler/issues/246 Disable browser updates]</s> Fixed in Zimit, but not yet upstream in Browsertrix
**Status of [https://github.com/webrecorder/browsertrix-crawler/issues/159 SSLError]
**<s>Status of [https://github.com/webrecorder/browsertrix-crawler/issues/159 SSLError]</s>
**[https://github.com/openzim/warc2zim/issues/109 First access to warc2zim file doesn't correctly catch external links]
**[https://github.com/openzim/warc2zim/issues/109 First access to warc2zim file doesn't correctly catch external links]


Line 32: Line 32:
** [https://github.com/openzim/warc2zim/issues/65 How communicate to a user the boundaries of a ZIM?]
** [https://github.com/openzim/warc2zim/issues/65 How communicate to a user the boundaries of a ZIM?]
** [https://github.com/openzim/zimit/issues/126 Should we still use Service workers?]
** [https://github.com/openzim/zimit/issues/126 Should we still use Service workers?]
** [https://github.com/openzim/warc2zim/issues/72 What kind of size optimisation should we run?]
** <s>[https://github.com/openzim/warc2zim/issues/72 What kind of size optimisation should we run?]</s> WONTFIX
** [https://github.com/openzim/warc2zim/issues/104 Assess pseudo namespaces]
** [https://github.com/openzim/warc2zim/issues/104 Assess pseudo namespaces]
**<bdi>[https://github.com/openzim/zimit/issues/166 Should we accept invalid HTTPs?]</bdi>
**<bdi>[https://github.com/openzim/zimit/issues/166 Should we accept invalid HTTPs?]</bdi>

Navigation menu