Comments for Data Horde https://datahorde.org Join the Horde! Thu, 13 Jun 2024 15:22:32 +0000 hourly 1 https://wordpress.org/?v=6.6 Comment on How to Archive or Scrape MediaFire Files using mf-dl by Daylan Allen https://datahorde.org/how-to-archive-or-scrape-mediafire-files-using-mf-dl/#comment-24319 Thu, 13 Jun 2024 15:22:32 +0000 https://datahorde.org/?p=2347#comment-24319 Hey there! I stumbled across this tool today and realized the download_link_prefix was having issues again, a little help from our new friend GPT and I have a working version now.

Make sure to install Beautifulsoup4 – `pip install beautifulsoup4`

In mfdl.py – Find the ‘find_direct_url’ function and replace it with this:

def find_direct_url(info_url):
rq = requests.get(info_url, headers=HTTP_HEADERS, timeout=TIMEOUT_T)
web_html = rq.text
soup = BeautifulSoup(web_html, ‘html.parser’)

# Find the download link
download_link = soup.find(‘a’, {‘class’: ‘input popsok’, ‘aria-label’: ‘Download file’})
if download_link is None:
return {“success”: 0}

direct_url = download_link[‘href’]

# Find the uploaded location
uploaded_from_tag = soup.find(‘p’, text=re.compile(‘This file was uploaded from ‘))
if uploaded_from_tag is None:
return {“url”: direct_url, “location”: “Unknown”, “success”: 1}

uploaded_from = uploaded_from_tag.text
location = uploaded_from.split(” on “)[0].replace(“This file was uploaded from “, “”)

return {“url”: direct_url, “location”: location, “success”: 1}

This uses BS4 to parse the webpage and look for the href download link we want, avoiding the manual way of just searching for the exact download_link_prefix. I would make a commit or merge, but it seems the 2 maintainers aren’t active on there anymore. Hope this helps anyone using the tool! Everything else in the tutorial (at least for downloading folders) should work fine.

]]>
Comment on How to recover your Yahoo! Groups from the Internet Archive by Simon https://datahorde.org/how-to-recover-your-yahoo-groups-from-the-internet-archive/#comment-24296 Wed, 12 Jun 2024 04:42:40 +0000 https://datahorde.org/?p=2293#comment-24296 In reply to themadprogramer.

Hi themadprogramer,

Thank you so much for this post. Quick query: If I get to the part of your post where you say:

“Raw data collections are stored in assorted, non-alphabetic, batches. To see if a group has its raw data available on the Internet Archive, simply query:”

… and no result appears ~ is there a workaround, or someone to contact to see when the OPEN group I have found (using the previous parts of your blog) posts might be uploaded again?

Thank you so much.
Simon

]]>
Comment on Yahoo! Groups Archive Metadata Now Available by Rhio2k https://datahorde.org/yahoo-groups-archive-metadata-now-available/#comment-23366 Thu, 04 Apr 2024 01:53:57 +0000 https://datahorde.org/?p=1849#comment-23366 What good is metadata? Have the groups THEMSELVES been saved? Can you access the conversations and files? Those of us who were members already KNOW how many members our groups had. We know our groups existed. We want to go back and re-read old conversations with people we no longer are in contact with.

]]>
Comment on How to Archive or Scrape MediaFire Files using mf-dl by HI https://datahorde.org/how-to-archive-or-scrape-mediafire-files-using-mf-dl/#comment-22501 Sat, 10 Feb 2024 06:16:46 +0000 https://datahorde.org/?p=2347#comment-22501 In reply to Nath.

Worked perfectly, thanks!

]]>
Comment on How to recover your Yahoo! Groups from the Internet Archive by caro https://datahorde.org/how-to-recover-your-yahoo-groups-from-the-internet-archive/#comment-21944 Sat, 13 Jan 2024 22:07:06 +0000 https://datahorde.org/?p=2293#comment-21944 In reply to Victoria kightlinger.

Awesome.

]]>
Comment on How to recover your Yahoo! Groups from the Internet Archive by Anonymous https://datahorde.org/how-to-recover-your-yahoo-groups-from-the-internet-archive/#comment-21943 Sat, 13 Jan 2024 22:05:52 +0000 https://datahorde.org/?p=2293#comment-21943 In reply to Victoria kightlinger.

That would be great even if just to read.

]]>
Comment on How to recover your Yahoo! Groups from the Internet Archive by Anonymous https://datahorde.org/how-to-recover-your-yahoo-groups-from-the-internet-archive/#comment-21942 Sat, 13 Jan 2024 22:04:57 +0000 https://datahorde.org/?p=2293#comment-21942 In reply to Victoria kightlinger.

That would be awesome, but so far have not heard anything. Thanks. Would love seeing the yahoo groups return even if we can’t post but can read.

]]>
Comment on Without being exploited: What archivists should learn from the XeNTaX forums aftermath by Anonymous https://datahorde.org/without-being-exploited-what-archivists-should-learn-from-the-xentax-forums-aftermath/#comment-21625 Wed, 27 Dec 2023 04:19:31 +0000 https://datahorde.org/?p=2923#comment-21625 In reply to Guest.

If you read the post on his site (the one that redirects to the discord server), the admin laments that certain egos in the community get on a soapbox and mask their own selfish desires as doing things ‘for the community’. You can see how it is then extraordinarily ironic that this admin shuts down the site and speaks for the ‘community’ about the archiving issue. He is effectively taking the ball and going home, flexing the last remnants of control in some hilarious ego power play. The fallout with Reshax, and anyone else reaching out to him in good faith, about continuing supporting the people he hosted on his forums (read: leeched off of and now apparently represents their interests) all but confirms this. There is no logic to it. But the author of this article is trying to rationalize the decision and present it in the most painfully neutral way to not step on anyone’s toes.

The author isn’t asking the most obvious question: Why the hell should the privilege of “privacy” be afforded to an admin/”community” who built their reputation and site off of decompiling and reverse engineering copyrighted and restricted materials? THAT is the most ironic part about this whole ordeal!

]]>
Comment on Yahoo! Groups Archive Metadata Now Available by Ibrahim mustapha https://datahorde.org/yahoo-groups-archive-metadata-now-available/#comment-21360 Tue, 12 Dec 2023 18:04:34 +0000 https://datahorde.org/?p=1849#comment-21360 Thanks

]]>
Comment on Yahoo! Groups Archive Metadata Now Available by Ibrahim mustapha https://datahorde.org/yahoo-groups-archive-metadata-now-available/#comment-21359 Tue, 12 Dec 2023 18:02:46 +0000 https://datahorde.org/?p=1849#comment-21359 Pls I want to join yahoo member how can I join.

]]>