
WebAssembly (wasm) files corrupted and incorrectly treated as text
Jul 8, 2019 · For a .wasm file archived on internet archive part of a full web page that was archived. The source for the original hosted site can be found on GitHub. It looks like the application/wasm Mime …
Heritrix is the Internet Archive's open-source, extensible ... - GitHub
Heritrix is designed to respect the robots.txt exclusion directives and META nofollow tags. Please consider the load your crawl will place on seed sites and set politeness policies accordingly. Also, …
Build failing via maven-assembly-plugin: group id is too big
Dec 3, 2021 · I am getting a build failure on RHEL 7.9 and Ubuntu 18.04 that looks to be introduced with the upgrade of maven-assembly-plugin (#414) regarding the group id being too big.
Real-time in-browser translation for all books #9594 - GitHub
Jul 19, 2024 · Fortuitously, recently Mozilla announced that it had ported Project Bergamot -- a set of neural machine translation tools -- into this web assembly implementation, which could be used to …
Commits · internetarchive/heritrix3 · GitHub
Sep 30, 2023 · Commits on Dec 3, 2021 Fix files included by assembly ldko committed Dec 3, 2021 Copy the full SHA a0f0ba4 View commit details Browse the repository at this point in the history Fix …
heritrix3/dist/src/main/bin/heritrix.cmd at master - GitHub
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. - internetarchive/heritrix3
GitHub - internetarchive/emularity-config: archive.org software …
archive.org software emulation. Contribute to internetarchive/emularity-config development by creating an account on GitHub.
emularity-engine/README.md at main - GitHub
The files are grouped and split up into 3 GitHub repositories (linked below) and corresponding deploys. Internet Archive is using docker OCI containers to serve the repo files as static files, with a lightly …
heritrix3/CHANGELOG.md at master - GitHub
Jul 27, 2022 · Build failing via maven-assembly-plugin: group id is too big #447 Do not require DNS when using a web proxy #211 Merged pull requests: Bump jsch from 0.1.52 to 0.1.54 in /commons …
Provision ol-web0 to load balance · Issue #9001 - GitHub
Provision ol-web0 to load balance #9001 mekarpeles opened this issue Apr 1, 2024 · 0 comments · Fixed by #9107 Assignees Labels Affects: Operations Affects the IA DevOps folks Lead: …