archive-team – Data Horde https://datahorde.org Join the Horde! Sun, 18 Jul 2021 18:36:17 +0000 en-US hourly 1 https://wordpress.org/?v=6.4.3 https://datahorde.org/wp-content/uploads/2020/04/cropped-DataHorde_Logo_small-32x32.png archive-team – Data Horde https://datahorde.org 32 32 Help Archive Team archive older unlisted YouTube videos! https://datahorde.org/help-archive-team-archive-older-unlisted-youtube-videos/ https://datahorde.org/help-archive-team-archive-older-unlisted-youtube-videos/#comments Sat, 17 Jul 2021 06:47:57 +0000 https://datahorde.org/?p=2530 With less than 5 days left until YouTube will make most unlisted videos uploaded before 2017 private, time is running out before these videos are lost forever!

Fortunately, Archive Team has started a project to back up the metadata and 360p resolution video files for as many of these items as possible, and contributing is really easy! In addition to the videos themselves, data to be archived by this project includes the video watch page (including titles, descriptions, uploader channel, etc.), captions, comments, attributions, and thumbnails. The data archived by this project will be made available in WARC format on the Internet Archive and through the Internet Archive Wayback Machine.

To help out with this project, simply follow the steps to download and run an Archive Team Warrior, and then select the YouTube project. (You can also run the project using a Docker container using atdr.meo.ws/archiveteam/youtube-grab as the image address.)

Additionally, people with lists of unlisted video IDs/URLs and unlisted playlist IDs/URLs are encouraged to share them so they can be archived.

In order to stay up-to-date with the project and be reachable in case of an issue, project contributors are encouraged to connect and stay connected to the project discussion channel, #down-the-tube on irc.hackint.org, also available through webchat.

Archiving progress statistics for this project are available on the Archive Team project tracker, and source code is available on GitHub.

After older unlisted videos are made private on July 23, this project will shift to archiving the metadata for as many YouTube videos as possible, though not the actual video files themselves in most cases due to the amount of storage video takes and limited resources of the Internet Archive.

]]>
https://datahorde.org/help-archive-team-archive-older-unlisted-youtube-videos/feed/ 1
Help Archive Team Save Yahoo! Answers! https://datahorde.org/help-archive-team-save-yahoo-answers/ https://datahorde.org/help-archive-team-save-yahoo-answers/#comments Thu, 22 Apr 2021 02:35:47 +0000 https://datahorde.org/?p=2207 Yahoo! Answers is shutting down on May 4th, 2021, taking nearly 15 years worth of content with it!

Archive Team is trying to save as much of it as possible, and you can help!

By setting up the Archive Team Warrior and letting it run in the background, you can back up questions and answers from Yahoo! Answers and make them available in the Internet Archive Wayback Machine. The Archive Team Warrior is easy to set up and uses very few of your system resources. The Archive Team Warrior can work on up to 6 items concurrently.

Advanced users can also run the project with Docker using the atdr.meo.ws/archiveteam/yahooanswers-grab Docker image, which can easily be deployed on large networks and allows for running projects at a higher concurrency rate per container (maximum 20 concurrent items, though users running the project with this many concurrent items might be rate-limited by Yahoo!).

If you need any help or have any questions about the project, please feel free to refer to the project page on the Archive Team Wiki or ask in Archive Team’s IRC channel for the Yahoo! Answers project. (Please be patient and stay connected if your question isn’t immediately answered so you don’t miss any responses.)

]]>
https://datahorde.org/help-archive-team-save-yahoo-answers/feed/ 1