Library of Congress Digital Collections – 700 Terabytes of Data and Growing…10.10.09

10 10 2009

Library of Congress

Here is an excerpt from Voice of America report US Library of Congress’ Digital Collection One of World’s Largest:

“…So far, the library has a total of 700 terabytes of data. But because of copyright issues, only 200 of those are available on the Web.

“A terabyte is about 1,600 CDs or about 330 hours of TV or about 2,000 books and we have about 500 terabytes that we keep in our long term preservation systems,” she adds.

At the Library of Congress, the numbers can be mind-boggling. Experts estimate they have more than 120 million books, 36,000 feature films, hundreds of thousands of music sheets and recordings, and the large collections of manuscripts, Web sites, posters and photography. Yet only one percent of it has been digitized.

Thomas Youkel is the senior systems engineer.’We have a scan lab here that scans anywhere from four to six million items a year,’…While workers continue scanning and digitizing millions of items, they keep an eye on a migration plan, to move from obsolete technology to new technology – a never ending process.”



Actions

Information

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s




Follow

Get every new post delivered to your Inbox.

Join 143 other followers