**JBBdude** @jbbdude@mastodon.bigdinosaur.org · Apr 25, 2017, 05:15

**JBBdude** @jbbdude@mastodon.bigdinosaur.org · Apr 25, 2017, 05:15

JBBdude @jbbdude@mastodon.bigdinosaur.org

Apr 25, 2017, 05:15

JBBdude @jbbdude@mastodon.bigdinosaur.org

The Internet Archive has decided to ignore robots.txt
https://blog.archive.org/2017/04/17/robots-txt-meant-for-search-engines-dont-work-well-for-web-archives/

This is amazing news for internet history.
a) All public stuff will be crawled. Don't want that? Don't make your shit public.
b) Lapsed domains replaced with parking pages using restricting robots.txt won't prevent old, dead versions of sites from being visible.

Archive everything. #NewLibraryofAlexandria #OCD

**Loam 🌱** @loam@computerfairi.es · 2017-04-26T07:22:15Z

Loam 🌱 @loam@computerfairi.es

@jbbdude point b seems especially good. not sure how i feel about point a, but i understand its reasoning.

Apr 26, 2017, 07:22 · · · ·

Resources

Developers

What is Mastodon?

computerfairi.es

More…