Difference between revisions of "Athens 2023"

From Kiwix
Jump to navigation Jump to search
(3 intermediate revisions by the same user not shown)
Line 18: Line 18:
We need to (does not have to be in this order):
We need to (does not have to be in this order):
* Assess current situation
* Assess current situation
** Present Webrecorder/Kiwix current activities and projects
** <s>Present Webrecorder/Kiwix current activities and projects</s> Ilya not available
** Present current sofware stack and how it interacts together
** Present current sofware stack and how it interacts together
** List and identify the weaknesses (at least the one not clearly identify already) in the current architecture/software  
** List and identify the weaknesses (at least the one not clearly identify already) in the current architecture/software  
Line 24: Line 24:
**Go over the crawler's CLI params to understand how/when to use them (<code>docker run --rm -it ghcr.io/openzim/zimit:dev crawl --help</code>)
**Go over the crawler's CLI params to understand how/when to use them (<code>docker run --rm -it ghcr.io/openzim/zimit:dev crawl --help</code>)
**<s>Status of <bdi>[https://github.com/webrecorder/browsertrix-crawler/issues/207 Success status code on failure]</bdi></s>
**<s>Status of <bdi>[https://github.com/webrecorder/browsertrix-crawler/issues/207 Success status code on failure]</bdi></s>
**Status of [https://github.com/webrecorder/browsertrix-crawler/issues/246 Disable browser updates]
**<s>Status of [https://github.com/webrecorder/browsertrix-crawler/issues/246 Disable browser updates]</s> Fixed in Zimit, but not yet upstream in Browsertrix
**Status of [https://github.com/webrecorder/browsertrix-crawler/issues/159 SSLError]
**<s>Status of [https://github.com/webrecorder/browsertrix-crawler/issues/159 SSLError]</s>
**[https://github.com/openzim/warc2zim/issues/109 First access to warc2zim file doesn't correctly catch external links]
**[https://github.com/openzim/warc2zim/issues/109 First access to warc2zim file doesn't correctly catch external links]


Line 32: Line 32:
** [https://github.com/openzim/warc2zim/issues/65 How communicate to a user the boundaries of a ZIM?]
** [https://github.com/openzim/warc2zim/issues/65 How communicate to a user the boundaries of a ZIM?]
** [https://github.com/openzim/zimit/issues/126 Should we still use Service workers?]
** [https://github.com/openzim/zimit/issues/126 Should we still use Service workers?]
** [https://github.com/openzim/warc2zim/issues/72 What kind of size optimisation should we run?]
** <s>[https://github.com/openzim/warc2zim/issues/72 What kind of size optimisation should we run?]</s> WONTFIX
** [https://github.com/openzim/warc2zim/issues/104 Assess pseudo namespaces]
** [https://github.com/openzim/warc2zim/issues/104 Assess pseudo namespaces]
**<bdi>[https://github.com/openzim/zimit/issues/166 Should we accept invalid HTTPs?]</bdi>
**<bdi>[https://github.com/openzim/zimit/issues/166 Should we accept invalid HTTPs?]</bdi>

Revision as of 16:48, 22 May 2023

This page summarizes the plans for the Kiwix Hackathon 2023 in Athens (to not be confused with Hackathon 2023 Paris.

Date & Venue

From Thursday 18 May (evening) to Friday 26 May (morning) in Athens (we have rent a flat).

Logistics

DO NOT FORGET TO BRING AN EXTENSION CORD (and an adapter if you are not joining from mainland Europe).

FYI Greece uses the same C, E and F sockets as the rest of Europe.

Goals

The main goal of the hackahton is to focus on Zimit and all its software stack: Browsertrix, warc2zim, python-libzim, ...

We want to prepare next big iteration on Zimit, considering that current version is the result of of the first iteration of 2020-21.

We need to (does not have to be in this order):

Achievements

Agenda

From Friday to Sunday there is the Wikimedia Hackathon for which at least Matthieu and Kelson has registered.

After that we will be all gathered to focus on Zimit.

Attendees

Kiwix
  • Reg (remote)
  • Kelson
  • MGauthier
  • Jaifroid
Webrecorder
  • Ilya (maybe)