-
Website
http://ajaydsouza.com -
Original page
http://ajaydsouza.com/archives/2006/08/25/aboutusorg-is-it-ethical/ -
Subscribe
All Comments -
Community
-
Top Commenters
-
Gregg Gordon
1 comment · 1 points
-
Thilak Rao
1 comment · 13 points
-
AndyBeard
1 comment · 4 points
-
benoythapa
3 comments · 1 points
-
Rob
1 comment · 1 points
-
-
Popular Threads
the best part is the notice on the editting page warning you not to contribute copyrighted information.
I stumbled upon it by chance. Was a bit happy at first but then realized the implications.
@adam, I too was quite amused by that. Talk about no copyright infringement, and the site does just that!
Two strange things about the listing - it capitalised the P in nexistepas, which I have never done, anywhere.
But the oddest thing was the list of sites that allegedly link to my domain. Out of the eight they listed (considerably fewer than actually do link to my blog), I'd never heard of three of them, and when I checked those out, they had no such link. Eh well.
And yes, the dire warning against posting copyright info is pretty funny, considering.
As to the ethics of the site, I'm not so sure I can get too excited about it: WHOIS data is in the public domain, is it not? If someone wants to root out my identity, it can be done more effectively than by visiting AboutUs...
I think it's okay for a search engine or sites like Technorati to display parts of my content because other user cannot edit it. But on a wiki there is no easy way to tell whether the information is correct or not (at least for an average user who doesn't fully understand the concept - and problems - of a wiki).
I'm not sure where its pulling that information from except maybe an old whois cache, but thats been private since almost a year ago. Then again, the screenshot was also at least 9+ months old.
I didn't need to log in to delete the information. I just hit edit and removed the full contents of the article. I'll have to keep an eye on it to see if its reverted or re-added.
I do feel a bit more vulnerable now that I saw that though... almost like those people that found out AOL released their search logs.
:S
I appreciate the feedback from you and your community. The intention of AboutUs is to be a valuable and free resource that allows users to share their thoughts and knowledge of various websites. That said, your concerns are well expressed and here are my comments:
Whois data is readily and publicly available from many sources, as is historical whois data, which often pre-dates a domain being switched to "private". I am not aware of any whois data service that first asks the registrant for permission to provide it -- not that this wouldn't be good, but it seems impractical. We list contact information because it is valuable to users wanting to contact website owners. The information can be changed simply by hitting the edit button.
Search engines also scrape some content from sites in order to give their readers a sense of what is on the site so the reader can decide if it's worth going to the site. AboutUs does the same thing. The goal is to present some information as a "stub" so that readers to easily recognize the site. It can be edited from there.
On capitalization, we algorithmically try to figure out where the word breaks are and unfortunately it is not always accurate. So my apologies for that and thanks for helping to correct some of these errors.
AboutUs is a wiki, and as such, its content is openly editable. Some site owners may feel the desire to check AboutUs to see if anyone else has modified the page referring to their site or commented positively or negatively -- just as they might want to know if someone made a blog or del.icio.us entry about their site. Watching for commentary on one's own site is par for the course if you publish a website in the first place. To make this easier however, it is our plan to add the ability to get an e-mail alert for any pages users put on their watch list. This will be coming as soon as we are able.
I am a big believer in wikis and their power to create collaborative works which would be impossible to do with fewer people -- yet they do pose some new problems to think about. We will be watching the changes made on our site carefully to keep the development of the AboutUs moving in a positive direction.
I will also do my best to respond to continued thoughts. Thanks - Ray.
Really appreciated. I guess AboutUs does have a long way to go and a lot of queries to be answered as it gets there.
Hilarious that when I went in to edit the pages about my site, there's a big message about not posting copyrighted content.
I'm just horrified that this site even exists.
On the front page, I typed my domain name into the "add missing domain" box, and told it to create the page. It took a second and did so, and then let me have a look see.
The resulting page is rather bereft of information. It has a screen shot and some unusual text for a description, but it's all more or less correct. Funnily enough, I recognize where all this information came from.
Near as I can figure, it basically scraped Alexa and possibly Technorati. The screen shot of my page came from somewhere I do not know. Alexa's screen shot of my page is different, and the screen shot it found is out of date, showing some older material. But the description is straight from Alexa, as is the email address and name and such.
Looking at my server logs, it doesn't appear to have actually hit my site at all, although I need to check the apache logs directly to be certain of that.
what is your comment on the problem summarized (again, not the first time this came up) by Lisa regarding copying our content, leaving copyright notices out and in some kind of wired irony saying on the edit page not to use copyrighted content?
You said search engines also use the content, sure, but they don't enable anybody to change this content!
Daniel
AboutUs should be "opt in" (in the same way that many sites like Technorati won't index your site unless you "ping" them) and should not editable by all and sundry. Plus it needs to be possible to purge your history, so that copyrighted material is actually removed.
Scenario: John B. Average becomes miffed with Jane B. Average (his competition) and decides to post all of her private information on the site. Jane B. Average has no idea this site exists, but starts wondering why strange people are e-mailing her, calling her, or even driving by her home. See where I'm going with this?
As is stands now, I don't believe AboutUs.org is unethical, but I don't care for it.
It's not a registrar or whois look-up service; the owner should give website owners the ability to opt-out of appearing in the list. Google, Yahoo!, MSN, and even The Internet Archives give webmasters a way to opt-out of their directories, and so should AboutUs.org.
My 2 cents.
~ Teli
It's sad I had to create the following robots.txt for just this bot.
User-agent: AboutUsBotDisallow: /
Like others, I checked my own entry and was disgusted that such info was publicly available.
I emailed Ray and surprisingly got a reply within minutes stating that he's drafting a new policy 'within the day'.
The real danger I think some have overlooked is that the Wiki's search function allows the entry of an individuals NAME and then lists info relating to the domains they own. Pretty dangerous, i would have thought, when it comes to some people (such as battered wives who may be blogging about stuff they'd rather a husband didn't know, for example).
I'm absolutely appalled by this site.
Truly.
Thanks for alerting us all to it's existence. I've also blogged about it, albeit less eloquently than yourself, advising my readers to check and, if required, edit any information already there.
Got your pingback from your post. See it below :)
1) There is now a way to get rid if the history and recreate a shorter article that doesn't contain descriptive content from the site or contact information. The shorter article will prevent the page from being recreated by the bot if someone tries to add it again. Unfortunately, I don't yet have an automated way to authenticate domain owners, so I ask that anyone who wants this done send me an e-mail (ray@aboutus.com) from the whois address listed (or send some other evidence of domain ownership) and I'll take care of them manually for now.
2) Also, the bot will now take much less descriptive information from the sites it looks at and label that text as "Excerpted from the website description" and point back to the site for more information. For examples of this, look at the "Sites recently added" link from the main page.
3) The bot now also respects robots.txt and there is an article on the site that describes several ways to make that happen. There will be examples of that in "Sites recently added" also.
I know this doesn't answer all of the questions posed here, but I will continue to listen and adjust as best I can. - Ray.
It is great to see all the efforts you are putting in to address our issues.
I actually like the idea. I see the problems it can cause but I do feel the best intentions are meant. I also think that this could happen to anyone of us at anytime. I see postings fairly regularly about ideas and content being 'nicked'.
But I think most of all I like the idea of bringing the sites I manage under one roof so to speak in this sort of concept.
Yes I have doubts but I wish Ray all the best with the site idea and with sorting out some of the issues people might have.
Thanks for listening.
Rich
Maybe you totally missed reading the copyright statement on my website:
http://www.rare-cancer.org/copyright.html
What you have done is illegal. It's called copyright infringement. And your wiki is of no use other than your own private purposes. I believe in wikis, such as wikipedia, but they do not scrape material off the web with a robot. And they do not use the site to sell advertising.
You have a number of my sites on the AboutUs website. Remove them. I have no problem getting lawyers to work for me, pro bono, because of the international rare cancer agency I run.
I would suggest that you hire a fleet of lawyers to defend yourself ungainst copyright infringement. You'll need them!
Sharon
I think it best to write to Ray directly from the AboutUS.org site. I don't think he will be reading the comments out here, especially after so many months.
You can also edit your site information in the Wiki directly.
Thanks for allowing me to add my 2 cents worth, also check you searchs in google to see if you have been victimized by this
Trabob
I can remove information, but I can't remove my listing. Anyone at any time can post erroneous, flaming or spammy information on my listing, therefore I must be constantly vigilant and participate in this site.
I have no problem with the idea of a wiki for websites, but it should be opt in. I don't want to wake up one day and find my family's address and phone number listed on some wiki! And aboutus should give us a way to really opt out, beyond merely removing information under our listing. We should be able to remove our own listing altogether.
Does anyone know if there is any to take action against Ray? So far my inquiries and comments to Ray have been answered by yes-men who assure me in a condescending tone that I 'just don't yet see the value of the site'. Oh, I see the value of the site alright. I see just how valuable it is for ole Ray to exploit all the hard work we put into our sites.
I posted a number of ideas on how to fight back at
http://www.stopaboutus.org/discuss/viewforum.ph...
so I wont repost them here, but would like to explore what we can do. If there is no laws against it maybe we need to talk to the people that make the laws, as this in no way seems right to me
Thanks
Trabob
This business of 'scraping' (aka STEALING) other people's web content is really a low-down scurvy way to try to capitalize on other people's creativity because the scrapers lack any creativity themselves.
I suggest everyone who is outraged by this do the following:
1. File complaint with Attorney General for your state and for California or wherever those aboutus.org jerks are.
2. POST a BIG NOTICE on the aboutus.org site indicating that the ABOUTUS.ORG PEOPLE HAVE VIOLATED YOUR COPY RIGHTS & PRIVACY BY LIFTING YOUR WEB CONTENTS AND WHOIS INFO AND PUBLICLY PUBLISHING THEM, WHICH IS ILLEGAL.
3. FILE a legal complaint and hopefully Class Action Lawsuits will soon be gathering on the horizons to put a stop to this BS!
Good luck.
PS The jerks at Alexa and that so-called Biblo thing in Egypt are even worse, because they cough up OLD OLD CONTENT that you have deleted from your web projects and they still make it accessible --which could cause serious privacy problems and even lead to dangerous situations for folks who are being stalked and trying to fly under the radar.
Be positive about this free exposure!
While agreeing with all of what you say above, I don't consider it much different to other search engines which scrape your content and keep a copy for themselves.
So long as they obey the robots.txt standard, I don't see the problem.
If you don't want something crawled just don't publish it, use an intranet or implement an authentication system.