Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

I have a question about how the Web archive works: can Zed ask them to turn down the archived copy and if he does will they remove it?

Does anybody know if Zed released his early blog posts under Creative Commons? Because if he did anybody can republish his old rant today.



Oh, I just noticed at the bottom of the archived page: 'Licensed under a Creative Commons Attribution License.'

The new Zed's blog doesn't have a CC reference, BTW


You really shouldn't CC-license things blindly. They are useful, but not appropriate in all circumstances. I see no reason to CC-license your blog posts. (Copyright gives you no control over ideas and fair use means people can quote you in most circumstances quotes are useful, so you're not really giving anything useful away anyhow.)


I think CC licensing on blogs (or email) is a form of conspicuous production, a way of signaling that you're part of the CC movement.


The web archive will retroactively remove anything they've archived from your domain if you disallow "ia_archiver" in your robots.txt file (and obviously, will stop crawling it).

http://www.archive.org/about/exclude.php


just to clarify because i've had to deal with this before, excluding ia_archiver from your robots.txt will prevent archive.org from indexing new versions and deny access to your old versions, but it will not make them delete any archived material from your site that already exists.

when you visit an archive.org page, their robot will fetch that site's robots.txt file in real-time (with some basic caching mechanism, i'd assume) and check for an exclusion.

if your site ever goes down after the point you block their robot, or you let your domain name expire, or some squatter buys your domain after you're done with it and doesn't put up a robots.txt file, or anything else that would prevent their site from reading your robots.txt at the time someone tries to view the archive, they will allow full access to your site archives.


He does state in the post that he is the copyright holder and won't give permission to repost it. That doesn't mean he hasn't already given express permission via a CC license previously, but, assuming he didn't...




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: