Replies

From Wikipedia:

Historically, Wayback Machine respected the robots exclusion standard (robots.txt) in determining if a website would be crawled or not; or if already crawled, if its archives would be publicly viewable. Website owners had the option to opt-out of Wayback Machine through the use of robots.txt. It applied robots.txt rules retroactively; if a site blocked the Internet Archive, any previously archived pages from the domain were immediately rendered unavailable as well. In addition the Internet Archive stated, “Sometimes a website owner will contact us directly and ask us to stop crawling or archiving a site. We comply with these requests.”[39] In addition, the website says: “The Internet Archive is not interested in preserving or offering access to Web sites or other Internet documents of persons who do not want their materials in the collection.”[40] [41]

10 posted on 06/03/2018 6:56:08 PM PDT by LostInBayport (When there are more people riding in the cart than there are pulling it, the cart stops moving...)

To: LostInBayport

12 posted on 06/03/2018 7:01:13 PM PDT by BlackAdderess (https://data.bls.gov/timeseries/LNS14000000)

To: LostInBayport

Such cleanup services are called reputation defenders but actually they just whitewash the past and bury the skeletons.

16 posted on 06/03/2018 7:03:16 PM PDT by a fool in paradise (Ads for Chappaquiddick warn of scenes of tobacco use. What about the hazards of drunk driving?)

To: LostInBayport

This is why you use Archive.is in addition to the WayBack Machine. And if it’s really important, archive it on your own hard drive, too.

29 posted on 06/03/2018 7:43:56 PM PDT by FreedomPoster (Islam delenda est)

To: LostInBayport

This is why you use Archive.is in addition to the WayBack Machine. And if it’s really important, archive it on your own hard drive, too.

30 posted on 06/03/2018 7:43:58 PM PDT by FreedomPoster (Islam delenda est)

FreeRepublic, LLC, PO BOX 9771, FRESNO, CA 93794