About 50 results
Open links in new tab
  1. WebAssembly (wasm) files corrupted and incorrectly treated as text

    Jul 8, 2019 · For a .wasm file archived on internet archive part of a full web page that was archived. The source for the original hosted site can be found on GitHub. It looks like the application/wasm Mime …

  2. Heritrix is the Internet Archive's open-source, extensible ... - GitHub

    Heritrix is designed to respect the robots.txt exclusion directives and META nofollow tags. Please consider the load your crawl will place on seed sites and set politeness policies accordingly. Also, …

  3. Build failing via maven-assembly-plugin: group id is too big

    Dec 3, 2021 · I am getting a build failure on RHEL 7.9 and Ubuntu 18.04 that looks to be introduced with the upgrade of maven-assembly-plugin (#414) regarding the group id being too big.

  4. Real-time in-browser translation for all books #9594 - GitHub

    Jul 19, 2024 · Fortuitously, recently Mozilla announced that it had ported Project Bergamot -- a set of neural machine translation tools -- into this web assembly implementation, which could be used to …

  5. Commits · internetarchive/heritrix3 · GitHub

    Sep 30, 2023 · Commits on Dec 3, 2021 Fix files included by assembly ldko committed Dec 3, 2021 Copy the full SHA a0f0ba4 View commit details Browse the repository at this point in the history Fix …

  6. heritrix3/dist/src/main/bin/heritrix.cmd at master - GitHub

    Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project. - internetarchive/heritrix3

  7. GitHub - internetarchive/emularity-config: archive.org software …

    archive.org software emulation. Contribute to internetarchive/emularity-config development by creating an account on GitHub.

  8. emularity-engine/README.md at main - GitHub

    The files are grouped and split up into 3 GitHub repositories (linked below) and corresponding deploys. Internet Archive is using docker OCI containers to serve the repo files as static files, with a lightly …

  9. heritrix3/CHANGELOG.md at master - GitHub

    Jul 27, 2022 · Build failing via maven-assembly-plugin: group id is too big #447 Do not require DNS when using a web proxy #211 Merged pull requests: Bump jsch from 0.1.52 to 0.1.54 in /commons …

  10. Provision ol-web0 to load balance · Issue #9001 - GitHub

    Provision ol-web0 to load balance #9001 mekarpeles opened this issue Apr 1, 2024 · 0 comments · Fixed by #9107 Assignees Labels Affects: Operations Affects the IA DevOps folks Lead: …