<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>North Carolina Digital Collections Collaboratory &#187; SAA</title>
	<atom:link href="http://digital.lib.ecu.edu/collaboratory/index.php?feed=rss2&#038;cat=7" rel="self" type="application/rss+xml" />
	<link>http://digital.lib.ecu.edu/collaboratory</link>
	<description>Bringing North Carolina Digital Collections Together</description>
	<lastBuildDate>Wed, 26 Jan 2011 20:56:04 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0.1</generator>
		<item>
		<title>What Flickr portends, in the long-(tail)-run</title>
		<link>http://digital.lib.ecu.edu/collaboratory/?p=59</link>
		<comments>http://digital.lib.ecu.edu/collaboratory/?p=59#comments</comments>
		<pubDate>Wed, 11 Mar 2009 18:37:36 +0000</pubDate>
		<dc:creator>Mark</dc:creator>
				<category><![CDATA[ECU]]></category>
		<category><![CDATA[Flickr]]></category>
		<category><![CDATA[SAA]]></category>
		<category><![CDATA[digital preservation]]></category>
		<category><![CDATA[digitization]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[Hathi Trust]]></category>
		<category><![CDATA[Special Collections]]></category>

		<guid isPermaLink="false">http://digital.lib.ecu.edu/collaboratory/?p=59</guid>
		<description><![CDATA[I&#8217;ve been meaning to write about my experience with Flickr for some time, but in some ways it&#8217;s better that this is following Brian&#8217;s excellent post. My best guess, I&#8217;ll plainly state, is that in the wake of Google&#8217;s mass digitization of published works (books, magazines, journals), major research libraries will only be distinguishable by [...]]]></description>
			<content:encoded><![CDATA[<p>I&#8217;ve been meaning to write about my experience with Flickr for some time, but in some ways it&#8217;s better that this is following Brian&#8217;s excellent post. My best guess, I&#8217;ll plainly state, is that in the wake of Google&#8217;s mass digitization of published works (books, magazines, journals), major research libraries will only be distinguishable by three things:<br />
<span id="more-59"></span></p>
<ol>
<li>their architecture (physical)</li>
<li>their librarians (departments)</li>
<li>their collections (special -&gt; digital)</li>
</ol>
<p>The most flexible and adaptable of these differentiating factors, then, will most likely be our websites and digitized items (eventually, perhaps, our “linked data”). Certainly the physical architecture of the libraries will change most slowly, but with an average website “face-lift” occurring &#8212; let&#8217;s say &#8212; every three years, presently it may actually be the case that the average time frame of personnel shift amongst libraries/departments is approximately equal to any major attention lavished on the website and/or the addition/migration of major digitized collections. But soon, surely, it will be indisputable that it&#8217;s our digital collections alone that will be our most adaptable differentiating factor.</p>
<p>As such, it will equally be the most powerful method to level the variances between our libraries (think “open source software”) and, paradoxically, also the most powerful method to differentiate our libraries (think local developments on top of “open source software”). Furthermore, though our digitized surrogates will certainly retain the provenance of their real-world counterparts by means of metadata, my contention is that in the eventual wake of the mass digitization of special collections, major research libraries will only be distinguishable by &#8212; not three &#8212; but just two things:</p>
<ol>
<li>their architecture</li>
<li>their librarians</li>
</ol>
<p>And so, as our most unique resources become de-referenceable, these acts will slowly but eventually erase all differences in our collections (barring, of course, or perhaps only being slowed by, forced authentication and registration of online “researchers”) and return libraries to a pre-website state.</p>
<p>Rather than being feared, however, this should be our ultimate goal. Though universities will certainly retain their differences and specialties, the special collections of the libraries (those available online) will become the collections of all (of course, libraries will still need to maintain these collections, but it&#8217;s likely that they&#8217;ll perform this as a shared and proportionally balanced duty).</p>
<p>I should clarify, though: being returned to a “pre-website state” will not mean that a library will suddenly cease to exist online. Rather, the library&#8217;s online existence will no longer be controlled and contained, at least not in the same manner (and the site, as we know it now, might simply not need to exist).   And this, finally, brings me back to Flickr, a small but timely example of the changes ahead.</p>
<p>But first, I should say, I had little to no experience with Flickr when setting out to include a sample of ECU&#8217;s digital collections  (<a href="http://www.flickr.com/photos/ecu_digital_collections">http://www.flickr.com/photos/ecu_digital_collections</a>). I mention this at the onset to highlight Flickr&#8217;s ease of use and, also, the potential reasoning behind my false steps along the way, especially since I ended up working through the necessary steps in, literally, a backward fashion.</p>
<p>What follows is a bullet-list summary of my experience in the process so far (and please feel free to ask me about any particulars).</p>
<p><strong>Basic steps:</strong></p>
<ul>
<li>Requested a free, non-commericial API from Flickr (<a href="http://www.flickr.com/services/api/">http://www.flickr.com/services/api/</a>)</li>
<li>Selected public domain/copyright free items to share</li>
<li>Batched and resized images to save space (wanted to be under the free account&#8217;s 100 MB/month cap)</li>
<li>Wrote an XSLT stylsheet to remove just a few pieces of data from our METS record
<ol>
<li>Title</li>
<li>Description</li>
<li>Date</li>
<li>Persistent URL</li>
</ol>
</li>
<li>Added just one or two tags to each image based on &#8220;format&#8221; and &#8220;virtual collection&#8221; (opted not to add our subjects as tags or to display them at all)</li>
<li>Used a text editor to take the output and put into a Python file, generally making sure that I wrapped strings in triple quotes</li>
<li>Finally, I used &#8220;<a href="http://stuvel.eu/projects/flickrapi">Beej&#8217;s Python Flickr API kit</a>&#8221; for the mass upload and user authentication</li>
</ul>
<p><strong>Next steps:</strong></p>
<ul>
<li>Update my old images with &#8220;<a href="http://www.flickr.com/groups/api/discuss/72157594497877875/">machine tags</a>&#8220;, as I neglected this in my first round</li>
<li>Add a new set (the free account lets you have up to three)</li>
<li>Get the Pro account</li>
<li>Use the API to funnel the comments and tags from Flickr to our site, thereby linking user-supplied data from this <a href="http://www.flickr.com/photos/ecu_digital_collections/3289183842/">Flickr image</a> to that <a href="http://digital.lib.ecu.edu/5053">same image</a> in our digital repository, which doesn&#8217;t yet have that comment or tag.</li>
<li>Join <a href="http://www.flickr.com/commons/">The Commons on Flickr</a> if approved</li>
</ul>
<p><strong>And things to consider if you set up a Flickr account:</strong></p>
<ul>
<li>Don&#8217;t select a birthday that would make you under 18, or else you&#8217;ll need to use a credit card to verify parental consent (as if that alone won&#8217;t make everyone underage grow up quite quickly)</li>
<li>If you&#8217;re using a free account, remember that though you&#8217;re able to upload 100MB a month, only your 200 most recent images will be displayed</li>
</ul>
<p><strong>Some interesting Library/Flickr articles to read:</strong></p>
<ul>
<li><a href="http://www.nytimes.com/2009/01/19/technology/internet/19link.html">Historical Photos in Web Archives Gain Vivid New Lives</a> by Noam Cohen</li>
<li><a href="http://journal.code4lib.org/articles/74">Developing an Academic Image Collection with Flickr</a> by Jeremy McWilliams</li>
</ul>
<p><strong>And finally:</strong></p>
<p>As our collections continue to disperse in the digital age, and as our collections begin to arrive in a predominately digital format, it&#8217;s my hope that we&#8217;ll be able to receive donations that can be shared online without restrictions.</p>
<p>At the same time, SAA and archivists in general will need to vigilantly campaign against the potential misappropriation of our collections.  Though commercial partners and vendors might not offer to digitize our special collections for free, they will certainly come knocking on the doors soon enough, explaining that they can host our vast collections; but in this respect I hope that we do not yield.  If we do, we&#8217;ll surely trap not only our libraries, but also our users; and, being a trap, the situation won&#8217;t come equipped with something as user-friendly as the Flicrk API to permit the free passport of our collections and users.  Certainly, then, there&#8217;s no reason that we shouldn&#8217;t already be imagining the equivalent of the <a href="http://www.hathitrust.org/">Hathi Trust</a> for our special collection materials.</p>
]]></content:encoded>
			<wfw:commentRss>http://digital.lib.ecu.edu/collaboratory/?feed=rss2&amp;p=59</wfw:commentRss>
		<slash:comments>5</slash:comments>
		</item>
		<item>
		<title>Launching a New Repository at ECU</title>
		<link>http://digital.lib.ecu.edu/collaboratory/?p=31</link>
		<comments>http://digital.lib.ecu.edu/collaboratory/?p=31#comments</comments>
		<pubDate>Sun, 01 Feb 2009 21:00:58 +0000</pubDate>
		<dc:creator>Gretchen</dc:creator>
				<category><![CDATA[ECU]]></category>
		<category><![CDATA[SAA]]></category>
		<category><![CDATA[State Library of North Carolina]]></category>
		<category><![CDATA[institutional repositories]]></category>
		<category><![CDATA[metadata]]></category>
		<category><![CDATA[repositories]]></category>

		<guid isPermaLink="false">http://digital.lib.ecu.edu/collaboratory/?p=31</guid>
		<description><![CDATA[I began working just over a year ago here at ECU, and we have finally finished our digital repository &#8212; a project that was actually begun before I started and which I never thought would take this long to complete. In retrospect though, I guess I should have based on the two years of labor [...]]]></description>
			<content:encoded><![CDATA[<p>I began working just over a year ago here at ECU, and we have finally finished our <a href="http://digital.lib.ecu.edu">digital repository</a> &#8212; a project that was actually begun before I started and which I never thought would take this long to complete. In retrospect though, I guess I should have based on the two years of labor it took to create the <a href="http://lib.umd.edu/digital">Fedora-based repository at the University of Maryland</a>. I had always assumed that experience had taken so long because of the organizational difficulties there (i.e. not having a dedicated programmer on the project and in the department that was in charge of it). Yet, it took almost as long here.<span id="more-31"></span></p>
<p>I now realize that nearly everything you do like this is going to take a long time, if: a) you want to do it right and b) your institution won&#8217;t quit asking you to do <a href="http://thescholarship.ecu.edu">other projects</a>!</p>
<p>All joking aside, we are enormously proud of our repository. The guiding principles were:</p>
<ul>
<li>one common database and infrastructure that could be utilized for multiple &#8220;collections&#8221;</li>
<li>Collections are nice and we do like them, but we will always have &#8220;uncollected&#8221; items in there and we need to design it so that you can still discover these</li>
<li>This repository will gather everything digitized for a patron request, for an exhibit, for a book, for preservation, or because it would be cool to digitize it</li>
</ul>
<p>Based on the above principles we used a back-end XML database (TextML) and a METS/MODS record for every object. In addition, we included Dublin Core records in the METS (scripted from the MODS) for the purposes of OAI harvesting, appropriate admin metatadata (MIX in most cases since we have a lot of images), and even the full text of TEI records when those were available. To date we only have one audio object, but should be ingesting another 20 or so by March.</p>
<p>We had the details of the backend database worked out in the first third of 2008, and then set ourselves to the task of figuring out how to present this stuff to people who would have no idea what the site was, and in many cases, who ECU and Joyner Library were. We took the approach that the site had to invite people into the content &#8212; we had to have lots of ways that they could come to page, click on one thing and get to content.</p>
<p>So we created &#8220;<a href="http://digital.lib.ecu.edu/subject.aspx">subject clouds</a>&#8221; and created <a href="http://digital.lib.ecu.edu/collections.aspx">collections</a> post-hoc (i.e. we picked some relevant themes, then looked at what we had and organized them together). Both of these things were created on the premise that the organization of physical items in the library isn&#8217;t necessarily the best organization for things on the web. The web can do both more and less than that.</p>
<p>We also provided more tools for navigating through materials once you found some by including <a href="http://digital.lib.ecu.edu/263">hyperlinks in the records</a>, and a <a href="http://digital.lib.ecu.edu/search.aspx?q=Tobacco&amp;index=subjects">faceted drill-down of the subject headings</a>. These, as well as the subject cloud and the organization of the post-hoc collections, were all based on LCSH subject headings. A simple look at size of the facet list or the subject cloud gives you a good indication of both the benefits and drawbacks of this approach. But we figured we would use what we had and I think in some sense it is successful.</p>
<p>What is most interesting to me, is that many of these design elements were things that I&#8217;ve been thinking about since my days at UM. Being able to realize them here with a fresh new project was really exciting. Certainly, it isn&#8217;t perfect, but I think it&#8217;s all too rare in our jobs that we get to come up with an idea, implement it, and then evaluate how it worked out. We are often saddled with maintaining past projects and living within the bounds of past decisions. Of course, admitting that means that I must acknowledge that one day the decisions we&#8217;ve made on this project will limit us somewhere else. But I think we&#8217;ve created enough leeway and enough to think about, that we should be busy for quite some time.</p>
]]></content:encoded>
			<wfw:commentRss>http://digital.lib.ecu.edu/collaboratory/?feed=rss2&amp;p=31</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
	</channel>
</rss>
