<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>North Carolina Digital Collections Collaboratory &#187; metadata</title>
	<atom:link href="http://digital.lib.ecu.edu/collaboratory/index.php?feed=rss2&#038;cat=3" rel="self" type="application/rss+xml" />
	<link>http://digital.lib.ecu.edu/collaboratory</link>
	<description>Bringing North Carolina Digital Collections Together</description>
	<lastBuildDate>Wed, 26 Jan 2011 20:56:04 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>Sustainability, but at What Price?</title>
		<link>http://digital.lib.ecu.edu/collaboratory/?p=511</link>
		<comments>http://digital.lib.ecu.edu/collaboratory/?p=511#comments</comments>
		<pubDate>Wed, 26 Jan 2011 20:56:04 +0000</pubDate>
		<dc:creator>Gretchen</dc:creator>
				<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[economy]]></category>
		<category><![CDATA[metadata]]></category>

		<guid isPermaLink="false">http://digital.lib.ecu.edu/collaboratory/?p=511</guid>
		<description><![CDATA[I’m just in the midst of reading this ITHAKA S+R report on the sustainability of digital “projects.” It’s one of those reports where the basic ideas are so simple, but you didn’t realize that you knew them until you read it. Among the things that they suggest are vital to sustainability of projects: Dedicated and [...]]]></description>
			<content:encoded><![CDATA[<div class="wp-caption aligncenter" style="width: 436px"><img title="Digital Preservation" src="http://farm5.static.flickr.com/4018/4311841068_207f6ff298.jpg" alt="" width="426" height="500" /><p class="wp-caption-text">from geekandpoke</p></div>
<p>I’m just in the midst of reading <a href="www.ithaka.org/...sustainability/.../SCA_Ithaka_SustainingDigitalResources_Report.pdf">this ITHAKA S+R report on the sustainability of digital “projects.”</a> It’s one of those reports where the basic ideas are so simple, but you didn’t realize that you knew them until you read it. Among the things that they suggest are vital to sustainability of projects:</p>
<ul>
<li>Dedicated and entrepreneurial leadership (this could also be ongoing leadership: how many digitization projects do you know of that fell apart or became static after the project manager left)?</li>
<li>A clear value proposition (as in clear evidence of the value of the project)</li>
<li>Minimizing direct costs (hello appropriate outsourcing!)</li>
<li>Developing diverse revenue sources (of course that’s a dirty word in some parts of the academy)</li>
<li>Clear accountability and metrics for success</li>
</ul>
<p><span id="more-511"></span>The question I am increasingly coming to grips with is: In an era of “Do More with Less” what priority does digitization take in the grand scheme of library work? Especially since a lot of us are still out there at the grass roots level trying to raise support. We know that there is a future for this material, but in some cases our faculty don’t yet fully realize what they can do with it (<a href="http://mkirschenbaum.wordpress.com/2011/01/22/what-is-digital-humanities/">Digital Humanists aside</a>). They may be pleased to be able to replicate as much of their traditional research paradigm online as they can, but it’s hard to argue that we need twice as many resources so we can provide redundant online and in-house services. The small but growing field of Digital Humanities can be effective partners, but they are just one group. Even our own fellow librarians may not be able to grasp how it is more than “putting pictures online”.</p>
<p>In the face of possibly taking another 10% budget cut here that will reduce the staff even further, is digitization the priority? If we do have to cut back, what do we deem the essential activities:</p>
<ul>
<li>Digitization, but less robust metadata</li>
<li>Digitization and metadata but little to no preservation activity</li>
<li>Very limited further development of repository tools (this includes both public and administrative tools)</li>
</ul>
<p>Obviously, a lot of work done by the digital curation communities (<a href="http://ils.unc.edu/digccurr/">DigCCurr</a>, <a href="http://www.dcc.ac.uk/">DCC,</a> and <a href="http://www.icpsr.umich.edu/icpsrweb/ICPSR/curation/index.jsp">ICPSR</a> being among the most prominent) can be a guide, but studies of sustainability seem to reinforce the idea that we can’t cut back, that the only way to ensure that these materials meet their potential is to continue to invest resources into maintaining them, and even to increase those resources into the future. So instead of cutting back as above, should I as project manager begin looking into more diverse and creative ways to find funding outside of my organization’s budget? Is it time to create the “digital swear jar” that Ann and I used to joke about (in which we would put an online donation link in the repository and ask people to donate a dollar every time they swore)?</p>
]]></content:encoded>
			<wfw:commentRss>http://digital.lib.ecu.edu/collaboratory/?feed=rss2&amp;p=511</wfw:commentRss>
		<slash:comments>8</slash:comments>
		</item>
		<item>
		<title>RDAFMODSXMLMARCARRRGGGGHHH!</title>
		<link>http://digital.lib.ecu.edu/collaboratory/?p=503</link>
		<comments>http://digital.lib.ecu.edu/collaboratory/?p=503#comments</comments>
		<pubDate>Fri, 19 Nov 2010 15:57:16 +0000</pubDate>
		<dc:creator>Gretchen</dc:creator>
				<category><![CDATA[ECU]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[the profession]]></category>

		<guid isPermaLink="false">http://digital.lib.ecu.edu/collaboratory/?p=503</guid>
		<description><![CDATA[I had the distinct pleasure of attending three RDA webinars in the last month and found them alternately informative, entertaining and frustrating. Here&#8217;s the lowdown: Informative: The quality and knowledge of the speakers was great. Having a basic understanding of all of the concepts, I was not overwhelmed but got a greater view into some [...]]]></description>
			<content:encoded><![CDATA[<div class="wp-caption aligncenter" style="width: 650px"><a href="http://www.flickr.com/photos/wainwright/380578681/"><img title="Alphabet Soup II" src="http://farm1.static.flickr.com/154/380578681_8ed606f7b0_z.jpg?zz=1" alt="Alphabet Soup II" width="640" height="427" /></a><p class="wp-caption-text">By Flickr user cdw9</p></div>
<p>I had the distinct pleasure of attending three RDA webinars in the last month and found them alternately informative, entertaining and frustrating. Here&#8217;s the lowdown:</p>
<p><strong>Informative: </strong>The quality and knowledge of the speakers was great. Having a basic understanding of all of the concepts, I was not overwhelmed but got a greater view into some specifics of RDA that I was lacking. There has obviously been a lot of attention to RDA in the Tech Services world, but not as much in Digital Collections-y worlds, I guess because we probably don&#8217;t always deal as directly with the big thugs like MARC and AACR2. However, I&#8217;ve been wondering for a while now why more digital libraries (especially open-source and homegrown ones like ours) are doing more experiments with RDA, RDF, and especially linked data. Here at ECU we grab a lot of records from the catalog and cataloguers create the metadata in our MODS/METS records, so we are not immune to this change. I am really anxious for us to do something in this vein. The possibilities for broadening the use of our repository and for enriching it are just too good to ignore.<span id="more-503"></span></p>
<p><strong>Entertaining:</strong> Again while the presenters were great, the entertaining part was to be found in the simultaneous chat. Now I don&#8217;t want to be uncharitable towards my colleagues, but there were comments there that were so clichéd and clueless I thought it had to be sarcasm. I&#8217;m not trying to stifle debate or legitimate dissent, but arguing that maybe libraries purposely SHOULDN&#8217;T keep up with the times to preserve our &#8220;ambience&#8221; and saying in one breath that cataloguers were too busy to learn new things and in the next that they were afraid they were no longer needed (but of course the idea of learning to skills to remain viable in a new information environment was out of the question), was astonishing to me. However, the entertainment soon led to&#8230;</p>
<p><strong>Frustrating:</strong> I&#8217;m not the <a href="http://dltj.org/article/thursday-threads-2010w44/">only</a> <a href="http://twitter.com/#!/Metadata_Maven">one</a> in library-land to have ended up slightly bewildered by the reaction at this webinar. My own frustration had a few different sources:</p>
<ol>
<li>It was unfortunate that many of the participants did not have a firm grasp of concepts like FRBR, the difference between data standards (XML, RDF) and content standards (RDA, AACR2), and individual metadata standards (Dublin Core). The webinar introduced concepts but didn&#8217;t cover them in depth and the resulting confusion led to a lot of misinterpretation.</li>
<li>I wished that there were separate webinars for those with differing levels of expertise, and with different job functions. I&#8217;m all for cataloguers having a basic understanding of data formats and system designers understanding content standards, but perhaps addressing the needs of each separately would have been more productive.</li>
<li>On the other hand, it&#8217;s not like RDA was dreamed up yesterday. If you are a professional in this field and you don&#8217;t get it by now, it&#8217;s not like there haven&#8217;t been plenty of opportunities.</li>
<li>Finally, I think some of the examples of this great new information environment do more harm than good. The attendees get too stuck on individual examples of what <em>could</em> be done, pick apart it&#8217;s flaws, and perceive it as a threat to &#8220;THE CATALOG.&#8221; Meanwhile, the point is to make the catalog BETTER. There were multiple comments on data deluge and the importance of catalogs due to some of the examples used (mashups of data from multiple sources), demonstrating that participants were so caught on that they missed the central message&#8230;that the changes will make catalogs easier to use, for both librarian and user.</li>
</ol>
<p>Of course there is plenty of frustration on the side of the debate, obvious to anyone familiar with the petition <a href="http://www.ipetitions.com/petition/norda/">against the RDA </a>test which grew out of OCLC-CAT discussions. *sigh&#8230;</p>
<p>Be sure to checkout the <em><a href="http://www.alatechsource.org/blog/2010/10/continuing-the-conversation-new-models-of-metadata.html">Continuing</a> <a href="http://www.alatechsource.org/blog/2010/11/continuing-the-conversation-rda-designed-for-current-and-future-environments.html">the</a> <a href="http://www.alatechsource.org/blog/2010/11/continuing-the-conversation-rda-vocabularies-in-the-semantic-web.html">Conversation</a></em> posts about the webinars at the ALA Techsource blog.</p>
]]></content:encoded>
			<wfw:commentRss>http://digital.lib.ecu.edu/collaboratory/?feed=rss2&amp;p=503</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>Commenting in the Digital Archive</title>
		<link>http://digital.lib.ecu.edu/collaboratory/?p=375</link>
		<comments>http://digital.lib.ecu.edu/collaboratory/?p=375#comments</comments>
		<pubDate>Thu, 25 Mar 2010 20:51:56 +0000</pubDate>
		<dc:creator>Gretchen</dc:creator>
				<category><![CDATA[description]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[the web]]></category>

		<guid isPermaLink="false">http://digital.lib.ecu.edu/collaboratory/?p=375</guid>
		<description><![CDATA[I just found a really interesting presentation (while I was admittedly doing a little vanity-googling for mentions of Digital Collections&#8230;I&#8217;m only human!): Let Me Tell You about My Grandpa: a Content Analysis of User Annotations to Online Archival Collections by Jessica Sedgwick, Archivist for Women in Medicine at Harvard Medical School. (I&#8217;ve just been informed, [...]]]></description>
			<content:encoded><![CDATA[<p>I just found a really interesting presentation (while I was admittedly doing a little vanity-googling for mentions of <a href="http://digital.lib.ecu.edu">Digital Collections</a>&#8230;I&#8217;m only human!):</p>
<p><a href="http://www.slideshare.net/jmsedgwick/let-me-tell-you-about-my-grandpa-a-content-analysis-of-user-annotations-to-online-archival-collections">Let Me Tell You about My Grandpa: a Content Analysis of User Annotations to Online Archival Collections</a> by Jessica Sedgwick, Archivist for Women in Medicine at Harvard Medical School.</p>
<p>(I&#8217;ve just been informed, as well, that this presentation was actually given at SAA last year)&#8230; </p>
<p>The research presented investigates how users interact with digital archives when they are given the facility to comment. The results of her analysis are interesting. While I’ve observed the same things, I’ve never thought about quantifying it like this. I think it’s also interesting that the site with the sparsest metadata had the highest number of comments, but the site with the most metadata had the highest proportion of corrections.</p>
<p>Anyway, just for your reading pleasure…</p>
]]></content:encoded>
			<wfw:commentRss>http://digital.lib.ecu.edu/collaboratory/?feed=rss2&amp;p=375</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>On the Inside, Looking Out</title>
		<link>http://digital.lib.ecu.edu/collaboratory/?p=355</link>
		<comments>http://digital.lib.ecu.edu/collaboratory/?p=355#comments</comments>
		<pubDate>Wed, 10 Mar 2010 19:45:32 +0000</pubDate>
		<dc:creator>gregoryl</dc:creator>
				<category><![CDATA[State Library of North Carolina]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[metadata]]></category>

		<guid isPermaLink="false">http://digital.lib.ecu.edu/collaboratory/?p=355</guid>
		<description><![CDATA[This is my first post here, so I’ll briefly introduce myself. Since November 2009, I’ve been the Digital Projects Liaison at the State Library of North Carolina. I get to work on a spectrum of things, including preservation, web site archiving, and digital collection building. Recently, I’ve been mulling over a collection we’re in the [...]]]></description>
			<content:encoded><![CDATA[<p>This is my first post here, so I’ll briefly introduce myself.  Since November 2009, I’ve been the Digital Projects Liaison at the State Library of North Carolina. I get to work on a spectrum of things, including preservation, web site archiving, and digital collection building.</p>
<p>Recently, I’ve been mulling over a collection we’re in the process of digitizing through our <a href="http://statelibrary.ncdcr.gov/digidocs/index.html" target="_blank">ASGII grant</a>.  It’s a group of 1960s urban development reports from our state publications collection.  Partially funded through a federal urban planning grant under the <a href="http://www.hud.gov/offices/adm/about/admguide/history.cfm#1950">Housing Act of 1954</a>, these reports were the product of efforts by the Division of Community Planning under the North Carolina Department of Conservation (now the <a href="http://www.enr.state.nc.us/">Department of Environment and Natural Resources</a>).  These reports deal with local areas – counties, towns, and even lakes or neighborhoods. They have titles like “Land use plan, Goldsboro, North Carolina” and “Population and economy of Forest City, N.C.”</p>
<p>Have your eyes glazed over yet? <span id="more-355"></span> Here’s the issue – these reports are great! Urban planners went out and scrutinized a community, reporting back with demographics, maps, photographs, personal interviews, and projections for the future that included jaunty stick figures walking down updated main streets.  The reports talk about areas of urban blight, have high level surveys of historic buildings, and even include soil and traffic maps.  If you want to see what downtown Graham was like in 1967 (someone does, right?) and what they predicted for that area at that point in time, these reports are a valuable resource.  The materials inside could really be of interest to genealogists, archaeologists, historians, local leaders – all user groups whose eyes will probably slide right over the words “land use plan.”</p>
<p>So what have I been mulling over? How do we draw people into a collection like this?  More specifically:  with mass digitization, or even moderately-large-scale digitization, how do we leverage information inside of digital objects when the usual amount of metadata won’t cut it?  In some ways, this is the traditional archival processing conundrum of item-level vs. folder-level vs. collection-level description.  But I’m not trying to justify item-level description here, I’m trying to bend my brain to think of ways we can draw the inside information out.</p>
<p>My first thoughts centered on doing some TLD (tender loving description) on the maps, which represent a pretty significant and useful subset of pages within these reports.  But that sort of care isn’t something I believe should be part of our priorities here. Then I thought “What I really need is more progressive OCR and presentation of digitized statistics.”  This sort of technology development is pretty well beyond my current capabilities and resources.  Finally, I pondered the current go-to idea for digital librarians:  tagging and other social phenomena that shift some of the description burden to the users.  While interesting in many respects, it wouldn’t get me the high quality, across-the-board coverage I’m after (assuming our users would even help us out).</p>
<p>As you can see, my first reaction is to throw more words at this challenge.  I think we sometimes try to tack on as much metadata as we can, as quickly as we can, all in the hopes of vying for fickle user attention.  I’m not sure throwing words at the items or the users is truly the most effective way to make inroads into our digital collections, but I certainly haven’t figured out what the most effective way is.  I’m still negotiating the line between a level of curatorial care that I don’t believe we can or should maintain, and the library equivalent of those web pages that simply load their white space with words to catch the search engines.  I’m sure these urban development reports won’t be the last things I see that make me think “There’s great stuff in here!”  I’m sure some librarian standing in front of a stack of scrolls thought the same thing.  What I’m not sure about, but am still working on, is how to get that inside information out.</p>
]]></content:encoded>
			<wfw:commentRss>http://digital.lib.ecu.edu/collaboratory/?feed=rss2&amp;p=355</wfw:commentRss>
		<slash:comments>31</slash:comments>
		</item>
		<item>
		<title>NC ECHO Metadata Initiatives and Sharing Metadata</title>
		<link>http://digital.lib.ecu.edu/collaboratory/?p=193</link>
		<comments>http://digital.lib.ecu.edu/collaboratory/?p=193#comments</comments>
		<pubDate>Mon, 01 Jun 2009 20:01:11 +0000</pubDate>
		<dc:creator>Nick</dc:creator>
				<category><![CDATA[metadata]]></category>

		<guid isPermaLink="false">http://digital.lib.ecu.edu/collaboratory/?p=193</guid>
		<description><![CDATA[Mark just posted about EAC and NCBHIO. North Carolina Biographical and Historical Information Online is one of the great metadata initiatives led and administered by NC ECHO. With recent and looming state budget cuts, I know the status and arrangement of NC ECHO is uncertain at the moment, but the standards and guides collected and [...]]]></description>
			<content:encoded><![CDATA[<p>Mark just posted <a href="http://joyner1302.wordpress.com/2009/06/01/gnu-wget-ncbhio/">about EAC and NCBHIO</a>.  North Carolina Biographical and Historical Information Online is one of the great <a href="http://www.ncecho.org/dig/index.shtml">metadata initiatives</a> led and administered by NC ECHO.  With recent and looming state budget cuts, I know the status and arrangement of NC ECHO is uncertain at the moment, but the standards and guides collected and interpreted by ECHO staff and published on their website remain tremendously useful.  <span id="more-193"></span></p>
<p>I think the simple, sensible <a href="http://www.ncecho.org/dig/ncdc.shtml">North Carolina Dublin Core</a> schema is a great starting place for any discussion about sharing metadata and digital objects among repositories in the state.  Is anyone actively mapping your fields to this?  (Then again, the schema is simple enough that it could be done after the fact, but I&#8217;m curious anyway about whether or not people are looking at NCDC.)</p>
<p>Which leads to my next question: who&#8217;s sharing metadata, and who are you sharing it with?  Some UNC collections (but not all, at least not yet) are harvested by OAI, which is fantastic, though OAI, like WorldCat, has the potential to get so large that only pretty specific searches or very patient browsing are rewarded.  It would be great if, in addition to OAI and other national union catalogs, we could share metadata among North Carolina digital collections.  Many of us are publishing materials that strongly complement (and in a few instances, duplicate) the work that others are doing.  If we just shared among ourselves (that is, collections focusing on North Carolina content), would we be able to use a shared schema that is more robust than some of the <a href="http://digital.lib.ecu.edu/collaboratory/?p=118">national initiatives</a> Gretchen mentioned earlier?  I&#8217;m not suggesting we try to duplicate or avoid some of the bigger efforts, but if we start amongst ourselves, would that be more achievable, or would we be better off just sticking with the existing/emerging standards already out there?</p>
]]></content:encoded>
			<wfw:commentRss>http://digital.lib.ecu.edu/collaboratory/?feed=rss2&amp;p=193</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Metadata Madness</title>
		<link>http://digital.lib.ecu.edu/collaboratory/?p=164</link>
		<comments>http://digital.lib.ecu.edu/collaboratory/?p=164#comments</comments>
		<pubDate>Tue, 05 May 2009 12:41:27 +0000</pubDate>
		<dc:creator>Nick</dc:creator>
				<category><![CDATA[metadata]]></category>
		<category><![CDATA[tools]]></category>

		<guid isPermaLink="false">http://digital.lib.ecu.edu/collaboratory/?p=164</guid>
		<description><![CDATA[Wrangling and taming metadata is a big part of my job. In the North Carolina Maps project, metadata comes from (at least) two different institutions, which use different schema and send things along to me in different formats. Not only does each record pass through many hands along the way, it also moves through a [...]]]></description>
			<content:encoded><![CDATA[<p>Wrangling and taming metadata is a big part of my job.  In the <a href="http://www.lib.unc.edu/dc/ncmaps/">North Carolina Maps</a> project, metadata comes from (at least) two different institutions, which use different schema and send things along to me in different formats.  Not only does each record pass through many hands along the way, it also moves through a variety of different tools, spending time at one point or another in Excel, Oxygen, GoogleDocs, and CONTENTdm.  It&#8217;s a workflow that works fine for us for now, but I&#8217;m interested in hearing how others manage metadata for digital projects, in particular,</p>
<p><span id="more-164"></span></p>
<ul>
<li>how do you move from an initial catalog record (such as MARC or EAD) to your final metadata format, and is the record enhanced along the way?</li>
<li>for metadata that&#8217;s created from scratch (or from a non-digitized record, such as a catalog card), where&#8217;s the first place data is entered?</li>
<li>how do you incorporate metadata from different departments in your institution (such as the descriptive metadata that might come from cataloging and preservaton metadata from digitization staff)?</li>
<li>and, for those of you working on collaborative projects, how do you bring in metadata from partners?</li>
</ul>
<p>I don&#8217;t mean this to sound like a formal survey as much as just a place to start the discussion.  I know we all want to end with elegant, portable METS records (if we&#8217;re using METS), but I suspect we all take different routes to get there.</p>
]]></content:encoded>
			<wfw:commentRss>http://digital.lib.ecu.edu/collaboratory/?feed=rss2&amp;p=164</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>What Flickr portends, in the long-(tail)-run</title>
		<link>http://digital.lib.ecu.edu/collaboratory/?p=59</link>
		<comments>http://digital.lib.ecu.edu/collaboratory/?p=59#comments</comments>
		<pubDate>Wed, 11 Mar 2009 18:37:36 +0000</pubDate>
		<dc:creator>Mark</dc:creator>
				<category><![CDATA[ECU]]></category>
		<category><![CDATA[Flickr]]></category>
		<category><![CDATA[SAA]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[Hathi Trust]]></category>
		<category><![CDATA[Special Collections]]></category>

		<guid isPermaLink="false">http://digital.lib.ecu.edu/collaboratory/?p=59</guid>
		<description><![CDATA[I&#8217;ve been meaning to write about my experience with Flickr for some time, but in some ways it&#8217;s better that this is following Brian&#8217;s excellent post. My best guess, I&#8217;ll plainly state, is that in the wake of Google&#8217;s mass digitization of published works (books, magazines, journals), major research libraries will only be distinguishable by [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been meaning to write about my experience with Flickr for some time, but in some ways it&#8217;s better that this is following Brian&#8217;s excellent post. My best guess, I&#8217;ll plainly state, is that in the wake of Google&#8217;s mass digitization of published works (books, magazines, journals), major research libraries will only be distinguishable by three things:<br />
<span id="more-59"></span></p>
<ol>
<li>their architecture (physical)</li>
<li>their librarians (departments)</li>
<li>their collections (special -&gt; digital)</li>
</ol>
<p>The most flexible and adaptable of these differentiating factors, then, will most likely be our websites and digitized items (eventually, perhaps, our “linked data”). Certainly the physical architecture of the libraries will change most slowly, but with an average website “face-lift” occurring &#8212; let&#8217;s say &#8212; every three years, presently it may actually be the case that the average time frame of personnel shift amongst libraries/departments is approximately equal to any major attention lavished on the website and/or the addition/migration of major digitized collections. But soon, surely, it will be indisputable that it&#8217;s our digital collections alone that will be our most adaptable differentiating factor.</p>
<p>As such, it will equally be the most powerful method to level the variances between our libraries (think “open source software”) and, paradoxically, also the most powerful method to differentiate our libraries (think local developments on top of “open source software”). Furthermore, though our digitized surrogates will certainly retain the provenance of their real-world counterparts by means of metadata, my contention is that in the eventual wake of the mass digitization of special collections, major research libraries will only be distinguishable by &#8212; not three &#8212; but just two things:</p>
<ol>
<li>their architecture</li>
<li>their librarians</li>
</ol>
<p>And so, as our most unique resources become de-referenceable, these acts will slowly but eventually erase all differences in our collections (barring, of course, or perhaps only being slowed by, forced authentication and registration of online “researchers”) and return libraries to a pre-website state.</p>
<p>Rather than being feared, however, this should be our ultimate goal. Though universities will certainly retain their differences and specialties, the special collections of the libraries (those available online) will become the collections of all (of course, libraries will still need to maintain these collections, but it&#8217;s likely that they&#8217;ll perform this as a shared and proportionally balanced duty).</p>
<p>I should clarify, though: being returned to a “pre-website state” will not mean that a library will suddenly cease to exist online. Rather, the library&#8217;s online existence will no longer be controlled and contained, at least not in the same manner (and the site, as we know it now, might simply not need to exist).   And this, finally, brings me back to Flickr, a small but timely example of the changes ahead.</p>
<p>But first, I should say, I had little to no experience with Flickr when setting out to include a sample of ECU&#8217;s digital collections  (<a href="http://www.flickr.com/photos/ecu_digital_collections">http://www.flickr.com/photos/ecu_digital_collections</a>). I mention this at the onset to highlight Flickr&#8217;s ease of use and, also, the potential reasoning behind my false steps along the way, especially since I ended up working through the necessary steps in, literally, a backward fashion.</p>
<p>What follows is a bullet-list summary of my experience in the process so far (and please feel free to ask me about any particulars).</p>
<p><strong>Basic steps:</strong></p>
<ul>
<li>Requested a free, non-commericial API from Flickr (<a href="http://www.flickr.com/services/api/">http://www.flickr.com/services/api/</a>)</li>
<li>Selected public domain/copyright free items to share</li>
<li>Batched and resized images to save space (wanted to be under the free account&#8217;s 100 MB/month cap)</li>
<li>Wrote an XSLT stylsheet to remove just a few pieces of data from our METS record
<ol>
<li>Title</li>
<li>Description</li>
<li>Date</li>
<li>Persistent URL</li>
</ol>
</li>
<li>Added just one or two tags to each image based on &#8220;format&#8221; and &#8220;virtual collection&#8221; (opted not to add our subjects as tags or to display them at all)</li>
<li>Used a text editor to take the output and put into a Python file, generally making sure that I wrapped strings in triple quotes</li>
<li>Finally, I used &#8220;<a href="http://stuvel.eu/projects/flickrapi">Beej&#8217;s Python Flickr API kit</a>&#8221; for the mass upload and user authentication</li>
</ul>
<p><strong>Next steps:</strong></p>
<ul>
<li>Update my old images with &#8220;<a href="http://www.flickr.com/groups/api/discuss/72157594497877875/">machine tags</a>&#8220;, as I neglected this in my first round</li>
<li>Add a new set (the free account lets you have up to three)</li>
<li>Get the Pro account</li>
<li>Use the API to funnel the comments and tags from Flickr to our site, thereby linking user-supplied data from this <a href="http://www.flickr.com/photos/ecu_digital_collections/3289183842/">Flickr image</a> to that <a href="http://digital.lib.ecu.edu/5053">same image</a> in our digital repository, which doesn&#8217;t yet have that comment or tag.</li>
<li>Join <a href="http://www.flickr.com/commons/">The Commons on Flickr</a> if approved</li>
</ul>
<p><strong>And things to consider if you set up a Flickr account:</strong></p>
<ul>
<li>Don&#8217;t select a birthday that would make you under 18, or else you&#8217;ll need to use a credit card to verify parental consent (as if that alone won&#8217;t make everyone underage grow up quite quickly)</li>
<li>If you&#8217;re using a free account, remember that though you&#8217;re able to upload 100MB a month, only your 200 most recent images will be displayed</li>
</ul>
<p><strong>Some interesting Library/Flickr articles to read:</strong></p>
<ul>
<li><a href="http://www.nytimes.com/2009/01/19/technology/internet/19link.html">Historical Photos in Web Archives Gain Vivid New Lives</a> by Noam Cohen</li>
<li><a href="http://journal.code4lib.org/articles/74">Developing an Academic Image Collection with Flickr</a> by Jeremy McWilliams</li>
</ul>
<p><strong>And finally:</strong></p>
<p>As our collections continue to disperse in the digital age, and as our collections begin to arrive in a predominately digital format, it&#8217;s my hope that we&#8217;ll be able to receive donations that can be shared online without restrictions.</p>
<p>At the same time, SAA and archivists in general will need to vigilantly campaign against the potential misappropriation of our collections.  Though commercial partners and vendors might not offer to digitize our special collections for free, they will certainly come knocking on the doors soon enough, explaining that they can host our vast collections; but in this respect I hope that we do not yield.  If we do, we&#8217;ll surely trap not only our libraries, but also our users; and, being a trap, the situation won&#8217;t come equipped with something as user-friendly as the Flicrk API to permit the free passport of our collections and users.  Certainly, then, there&#8217;s no reason that we shouldn&#8217;t already be imagining the equivalent of the <a href="http://www.hathitrust.org/">Hathi Trust</a> for our special collection materials.</p>
]]></content:encoded>
			<wfw:commentRss>http://digital.lib.ecu.edu/collaboratory/?feed=rss2&amp;p=59</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Launching a New Repository at ECU</title>
		<link>http://digital.lib.ecu.edu/collaboratory/?p=31</link>
		<comments>http://digital.lib.ecu.edu/collaboratory/?p=31#comments</comments>
		<pubDate>Sun, 01 Feb 2009 21:00:58 +0000</pubDate>
		<dc:creator>Gretchen</dc:creator>
				<category><![CDATA[ECU]]></category>
		<category><![CDATA[SAA]]></category>
		<category><![CDATA[State Library of North Carolina]]></category>
		<category><![CDATA[institutional repositories]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[repositories]]></category>

		<guid isPermaLink="false">http://digital.lib.ecu.edu/collaboratory/?p=31</guid>
		<description><![CDATA[I began working just over a year ago here at ECU, and we have finally finished our digital repository &#8212; a project that was actually begun before I started and which I never thought would take this long to complete. In retrospect though, I guess I should have based on the two years of labor [...]]]></description>
			<content:encoded><![CDATA[<p>I began working just over a year ago here at ECU, and we have finally finished our <a href="http://digital.lib.ecu.edu">digital repository</a> &#8212; a project that was actually begun before I started and which I never thought would take this long to complete. In retrospect though, I guess I should have based on the two years of labor it took to create the <a href="http://lib.umd.edu/digital">Fedora-based repository at the University of Maryland</a>. I had always assumed that experience had taken so long because of the organizational difficulties there (i.e. not having a dedicated programmer on the project and in the department that was in charge of it). Yet, it took almost as long here.<span id="more-31"></span></p>
<p>I now realize that nearly everything you do like this is going to take a long time, if: a) you want to do it right and b) your institution won&#8217;t quit asking you to do <a href="http://thescholarship.ecu.edu">other projects</a>!</p>
<p>All joking aside, we are enormously proud of our repository. The guiding principles were:</p>
<ul>
<li>one common database and infrastructure that could be utilized for multiple &#8220;collections&#8221;</li>
<li>Collections are nice and we do like them, but we will always have &#8220;uncollected&#8221; items in there and we need to design it so that you can still discover these</li>
<li>This repository will gather everything digitized for a patron request, for an exhibit, for a book, for preservation, or because it would be cool to digitize it</li>
</ul>
<p>Based on the above principles we used a back-end XML database (TextML) and a METS/MODS record for every object. In addition, we included Dublin Core records in the METS (scripted from the MODS) for the purposes of OAI harvesting, appropriate admin metatadata (MIX in most cases since we have a lot of images), and even the full text of TEI records when those were available. To date we only have one audio object, but should be ingesting another 20 or so by March.</p>
<p>We had the details of the backend database worked out in the first third of 2008, and then set ourselves to the task of figuring out how to present this stuff to people who would have no idea what the site was, and in many cases, who ECU and Joyner Library were. We took the approach that the site had to invite people into the content &#8212; we had to have lots of ways that they could come to page, click on one thing and get to content.</p>
<p>So we created &#8220;<a href="http://digital.lib.ecu.edu/subject.aspx">subject clouds</a>&#8221; and created <a href="http://digital.lib.ecu.edu/collections.aspx">collections</a> post-hoc (i.e. we picked some relevant themes, then looked at what we had and organized them together). Both of these things were created on the premise that the organization of physical items in the library isn&#8217;t necessarily the best organization for things on the web. The web can do both more and less than that.</p>
<p>We also provided more tools for navigating through materials once you found some by including <a href="http://digital.lib.ecu.edu/263">hyperlinks in the records</a>, and a <a href="http://digital.lib.ecu.edu/search.aspx?q=Tobacco&amp;index=subjects">faceted drill-down of the subject headings</a>. These, as well as the subject cloud and the organization of the post-hoc collections, were all based on LCSH subject headings. A simple look at size of the facet list or the subject cloud gives you a good indication of both the benefits and drawbacks of this approach. But we figured we would use what we had and I think in some sense it is successful.</p>
<p>What is most interesting to me, is that many of these design elements were things that I&#8217;ve been thinking about since my days at UM. Being able to realize them here with a fresh new project was really exciting. Certainly, it isn&#8217;t perfect, but I think it&#8217;s all too rare in our jobs that we get to come up with an idea, implement it, and then evaluate how it worked out. We are often saddled with maintaining past projects and living within the bounds of past decisions. Of course, admitting that means that I must acknowledge that one day the decisions we&#8217;ve made on this project will limit us somewhere else. But I think we&#8217;ve created enough leeway and enough to think about, that we should be busy for quite some time.</p>
]]></content:encoded>
			<wfw:commentRss>http://digital.lib.ecu.edu/collaboratory/?feed=rss2&amp;p=31</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
