Announcing the Open Content Alliance

  • Posted October 2nd, 2005 at 9:02 pm by Yahoo! Search
  • Categories: Guest Bloggers

From time to time we’ve invited guest bloggers to write on the Yahoo! Search blog. Today we welcome Brewster Kahle, founder of the Internet Archive. We asked Brewster if he’d like to introduce the Open Content Alliance.


Is Open Content the next step in the traditions of Open Source and an Open Network? Many people seem to think so (and wouldn't it be great?). Working with libraries, government institutions, archives, technology companies, web companies-- and we all are saying the same thing-- it is time to have more great material available on the Internet and to be able to have it be open and free.

The opportunity before all of us is living up to the dream of the Library of Alexandria and then taking it a step further-- Universal access to all knowledge. Interestingly, it is now technically doable. Then the question became-- is it in the interest of enough people and institutions to get there? Some hang-ups have been around costs, rights, and guidelines for sharing. All of these things were worked out for their domains by Internet folks and open source folks in the last few decades. But how are we going build a system that has everything available to everyone?

I am jazzed to say that a group of organizations is starting an Open Content Alliance to try out answers by joining new and existing collections. We are looking for more contributors and helpers. We are starting with a set of principles.

To kick this off, Internet Archive will host the material and sometimes helps with digitization, Yahoo will index the content and is also funding the digitization of an initial corpus of American literature collection that the University of California system is selecting, Adobe and HP are helping with the processing software, University of Toronto and O’Reilly are adding books, Prelinger Archives and the National Archives of the UK are adding movies, etc. We hope to add more institutions and fine tune the principles of working together.

Initial digitized material will be available by the end of the year.

So the costs are mostly being borne by the host institutions based on their own fundraising or business models. The cost of digitization is sometimes offset by a different party (in the case of American Lit– Yahoo!). We think this can scale to millions of books movies and audio recordings.

Yahoo! has been great to work with on this because they get it, and have substantial abilities to cause things to happen. I find it interesting with how enduring a company’s culture is. Jerry Yang and David Filo’s personalities are still quite evident in the company today.

The rights issues come in many flavors, but our guiding principle is to offer high-resolution, downloadable, reusable files of the public domain. When we are dealing with in-copyright materials, the Internet Archive has been leveraging the creative commons licenses to great effect. In-copyright issues remain, but at least we can get substantial work going on the public domain.

We believe that donors should have the option to restrict the bulk re-hosting of a substantial part of a collection. This seems fair and is similar to the Creative Commons Sampling license. Interestingly University of California and Yahoo have decided to not put any restrictions. So if another library wants to re-host these on their website, or another search engine wants to integrate them into their page flipping system, they are welcome to. This is so great’let’s let the public domain stay public and build business models on in-print materials.

To be clear, the public domain works in the Open Content Alliance can be “borrowed” in bulk for build navigation services, do research on, and the like. Bits and pieces of the public domain collections can be re-used and re-interpreted. If someone wants to print and binding a book and sell it on Amazon.com– go nuts, if they want to make it into an audio book and post it on the web– go for it (we will even supply the hosting for this), basically let’s have a blast building on the classics of humankind.

On October 25th we will be demonstrating some of the new bookscanning and partner technologies.

If anyone is interesting in helping with this, please contact us at oca at archive dot org.

Thank you!

-brewster
Founder, Digital Librarian Internet Archive

  • 13 Comments
  • Subscribe

RSS feed

13 Comments

Comment by Bruce Albrecht
2005-10-03 08:26:34

Will the OCA have a process to identify works that are in the public domain because they were originally published after 1922 but the copyright was not renewed? Will these works be digitized and made available by the OCA?

 
Comment by spytrdr
2005-10-03 13:25:05

Congratulations. I don”t like the domain you chose for this project (“opencontentalliance.org”), though.
Since we are trying to build a “universal digital library” of the world’s knowledge, why not go for:
unilibrary.com/net/org ?
Regards,
S

 
2005-10-03 13:41:08

;-) Ideally
it would be wonderful if ALL the major Search Engines could just get together …ONCE -
forget about competition -
and Collectively FUND and Participate in this project.

Imagine what additonal benefits Microsoft, AOL, DMOZ and Google could also add COLLECTIVELY!.

Just hope politics and egos will not hobble this information revolution effort.

 
Comment by dave gottschalk
2005-10-03 19:37:20

Can you tell me any more information on the hardware and software that you will use to scan in these thousands of books? Will you harm the books by opening them wide open and laying them on a scanner? How fast can you scan a 100 page book? 400 pages? How large will the files be that hold a 100 page book? Does your scanner have a automatic feeder for the book, or does it have to be manually held and each page turned by hand?

Thanks for your response.

Dave Gottschalk
currently in Nagoya, Japan

 
Comment by Surendra raju
2005-10-03 19:47:33

Really, we were waiting for such contents free on WEB. Sincere thanks to to you all and all those PUBLISHERS, LIBRARIES who are providing their books to you.
wish all ther success and fame.

 
Comment by Bill
2005-10-04 13:26:07

Copyright Clearance Center, a not for profit organization, has web based solutions that allow permission and licensing at the point of content.
This allows honest people an easy avenue for content licensing and re-use.
The solution is called Rightslink.

 
Comment by Chris Eveleigh
2005-11-01 06:41:17

The phrase “Universal access” gets me thinking about the 6 out of 7 people in the world who don’t have access to the Internet, or whose Internet connection is intermittent, slow, and expensive. Perhaps this collection could be mirrored internationally on the LAN’s of institutions in developing countries, with updates delivered periodically via DVD-ROM, or satellite radio… “Universal access to all knowledge” is an admirable goal, but I think we need to keep in mind that our definition of “Universal” must include those without the technological infrastructure that so many of us (in the 1 in 7) take for granted in our daily lives. (And, yes, I have an agenda–see: http://www.egranary.org for more.)

 
Comment by azriel
2005-11-05 02:46:49

A good idea all out of copyright material should be essentially free.
Paper books are overpriced bulky and frankly belong in the dustbin of history.
I don’t want a physical copy of a book any more than I want a handwritten manuscript on parchment.
Tough luck to those publishers who want to charge exorbitant fees for copying an out of copyright book.
Copyright is to reward originality and nothing more.

 
Comment by Bob
2005-11-10 21:39:21

I have tried to look up a couple of books that should be copyright free but I found that each had comments by reserchers or commentators which made them copyright material. I speak of “Morte de Arthur” and “The Tempest”. Couldn’t a non-copyright copy be found?
Bob

 
Comment by Mr. Jones
2007-02-22 07:13:11

Open Source, Open content…
I don’t understand the buisness model?
Is advertising so cool thing to get so much money from it to feed open source and open content makers?

from:
http://nero.datadiri.com/

 
Comment by John Mathew
2007-02-27 04:45:05

I think it’s good idea

 
Comment by Maria Roger
2007-02-27 04:51:56

There is a very good site related to USMLE preparations.Hope if any one is preparing for it then this site will help in there preparations.

usmle.100free.com

 
Comment by ocnsss
2007-03-28 16:44:14

On personal opinion, I find this very helpful.
Guys, I have also posted some more relevant info further on this, not sure if you find it useful: http://www.bidmaxhost.com/forum/

 

Sorry, the comment form is closed at this time.

back to yahoo! search

subscription options

Facebook Fans

latest posts

archives