DuraSpace – A Foggy Notion?
Tags: cloud, dspace, duraspace, fedoraSpent an hour or so today participating in a WebEx presentation hosted by Sun, the DSpace Foundation and the Fedora Commons. Billed as “A Collaboration Update” this event basically served to offer an introduction, rationale and development path for DuraSpace (sounds familiar doesn’t it?).
Where to begin? On some level, I guess the headline of today’s session was:
Trusted Repositories are So Yesterday. Welcome to the Cloud
I use cloud storage today to store extra backup copies of some of our library websites and other data but I don’t think of it as an overly reliable solution. At least, not in the sense that it’s the only place my data lives. That sort of dependability isn’t the IT problem the cloud was created to solve. If you doubt that, take a look at the Service Level Agreement Amazon offers for their S3 service. It’s quite specific in some areas but vague when it comes to total data loss:
If availability is impacted by factors other than those used in our calculation of the Error Rate, we may issue a Service Credit considering such factors in our sole discretion.
Sorry, your data’s gone. But we will give you a credit in case you want to give us some more data. Comforting, isn’t it?
It’s possible that DuraSpace might eventually morph into something useful:
- maybe a LOCKSS-inspired dispersal of IR content across multiple cloud-based systems
- or a cloud-based aggregation of all the world’s repository content under a common set of APIs
but that doesn’t seem to be the current focus. Instead, there was prominent mention made of “value-add” services that could be performed on the data coming from your site on its way to the cloud. And I also heard an allusion to the fact that with DuraSpace those sites that have run out of storage or computer power will now have an option for their continued existence. I didn’t find either terribly convincing. I can see that it might be nice to have my data “improved” on it’s way to the cloud but wouldn’t I then want to replicate those improvements on my local copy of the data? And given today’s pricing, what site running DSpace can’t afford another terabyte drive?
While it may be buzzword-compliant, I’m afraid at this point DuraSpace sounds like a “let’s try to monitize our involvement in this thing” solution searching for a problem. Maybe I missed the point. I hope so. I also hope that the development of DuraSpace won’t steal too many cycles from those currently working on improving the core software assets of the DSpace and Fedora projects.
|
|
|
|
Comments(2)
It looks like you missed the whole premise of why we are looking to develop DuraSpace. IT IS NOT INTENDED TO REPLACE THE REPOSITORY by any means. It is a web based meditating service where you can replicate your content to multiple cloud providers- therefore putting your data at multiple geographies and multiple administrations. We believe this is of interest to organizations trying to make more than a single copy within their own environment. The concept is similar to lockks with the exception you do not have to run a lockks server, most likely it will be cheaper, and lockks does not scale for large files ( ie ARC files). It will not take away from the development of DSpace of Fedora, hopefully it will help better fund some of that development- but that is yet to be proven. the value added services are other things you can do with COPIES of you data once it is in the clould- YES which will sync back to your primary source data at your repository. For example- run the JHOVE validation tool, run migration format tools ect.
Hope that helps clarify some of the information.
Well, I’m glad to hear that I missed the point of DuraSpace and glad to see that you aren’t expecting it to take away from development of DSpace.
I’ll take some of the responsibility for getting it wrong but I was listening carefully. I’ll suggest you focus the presentation more on the benefits to be derived from DuraSpace and less on the fact that cloud storage is being used. The “cloud” angle really isn’t much of a selling point since cloud storage is already there for anyone who wants to use it.