All items from DuraSpace Blog » 2008 » March

Ithaca, NY A maintenance release that fixes several bugs present in Fedora 2.2.1 is now available. In addition, the release includes key improvements to the journaling and policy enforcement modules. Download the 2.2.2 Release Notes for more detail.
Key improvements include:
• Policy Enforcement
Feature #1743769: Multiple Owner id Support
Fedora’s bundled XACML engine can now act on policies involving objects that have more than one owner. See Fedora Authorization with XACML for more information on configuring and using this feature.
• Journaling
The following improvements enable more sophisticated high availability configurations with Fedora. See the Journaling Guide for more information on configuring and using these features.
• RMI Support
Journal leaders/followers can now be configured to communicate via RMI.
• Multicast Support
Journaling now supports multiple followers via RMI.
• Read-only Mode
Formerly, when configured as a follower, a Fedora repository could not be exposed via API-A or API-M. Now it is possible to set a follower to be exposed via API-A.
• Miscellaneous
Fedora’s out-of-box SSL server certificate (not intended for production use) has been updated with a later expiration date, for convenience.

Aarhus, Denmark DORSDL2 will be held on September 18, 2008, in Aarhus, Denmark. The meeting is being organized in conjunction with the 12th European Conference On Research And Advanced Technology For Digital Libraries.
Digital libraries and digital repositories are–in many ways–two sides of the same coin. This workshop brings together researchers and practitioners from both fields, and aims to transfer knowledge and connect. The target group are primarily repository researchers, developers, and managers. The workshop addresses both experiences and novel concepts with a technological and/or organisational stance.
Issues in the scope of the workshop include:
•Digital object models, behaviors, profiles, and prototypes.
•Relationships between digital objects.
•Interoperability mechanisms - e.g. OAI Object Reuse and Exchange.
•Workflows, cataloging, batch ingestions, versioning, audit trails, etc.
Critical digital library mechanisms.
•Examples of digital object repository systems at work in digital libraries.
•Requirements, models and support for curation and preservation of digital objects.
•Storage layer (e.g. SAN, grid integration).
•Evaluations and comparisons of digital object repository systems with respect to extensibility, flexibility, scalability, preservation, etc.
Digital repository systems and/or digital library systems of interest include:
•FEDORA, Flexible Extensible Digital Object Repository Architecture
•DSpace, Open Source Digital Repository System
•EPrints, Eprints Open Access Repository
• aDORe, A Modular, Standards-Based Digital Object Repository
Note that the above lists should not be considered restrictive. The final program will reflect the most interesting issues for this community.
The workshop aims to cover a variety of practical digital library development issues and how their resolution can (or cannot) be carried out in the context of the digital object repository at hand.
This workshop proposal is related to a workshop at ECDL 2006: Digital Object Repository Systems in Digital Libraries, Alicante, Spain. Issues that have more emphasis in 2008:
•Digital Object Repository Interoperability
•Digital Object relationships
•The storage layer
•Access control in the access layer
Submission of Extended Abstracts
Authors are invited to submit extended abstracts, which should be written in English and prepared according to the Springer LNCS format guidelines (2 pages maximum). Authors are invited to send their manuscripts in PDF format as attachments by email to the DORSDL2 Program Committee by 30 April. Acceptance notification date is 23 May. Please, note that the submission implies the willingness of at least one of the manuscript authors to register and present the paper. Accepted contributions will be published on the workshop’s web site.
Important Dates
30 April: Abstract submission deadline
23 May: Notification date
31 July: Early registration deadline
18 September: Workshop

Ithaca, NY The National Science Digital Library (NSDL) NCore team has announced the release of an open source generic version of the NSDL Search and NDR Interactions MediaWiki extensions and WordPressMU plugins that power the NSDL Expert Voices blogosphere and NSDL Wiki. Download the plugins here:
http://sourceforge.net/project/showfiles.php?group_id=204090
http://wiki.nsdl.org/index.php/NCore
NCore technologies and standards replace NSDL’s original metadata records-based data paradigm allowing for greater flexibility in collaborating and creating context around library resources. In the initiative’s first phase, completed in January 2007, NSDL Core Integration successfully implemented existing library services on top of NCore—a change that was largely invisible to users. Now, NSDL is leveraging what the Fedora-based NCore platform has to offer by introducing next-generation library services and collaborative tools.

Fedora Day at Open Repositories 2008NSDL Annotation will be held on April 3, 2008 and brings 32 Fedora project leaders and developers together to share their work with some of the 400+ OR08 attendees. The Fedora Day Program Committee is chaired by Richard Greene, Manager, RIDIR, REMAP and RepoMMan Projects at the University of Hull. Presentations will include examples of Fedora case studies, preservation and management solutions, architecture, datasets, search, programming issues, front ends, and semantic technologies. Download presentation abstracts here.

San Francisco, CA The Topaz Project hosted developers from Mulgara, Fedora Commons, and Topaz on March 5-6, 2008 for a series of design meetings and discussions based on Mulgara’s XA2 Storage Layer.

Paul Gearon, Mulagara, presented an overview of how Mulgara’s storage currently works. It consists of a “string pool” (mapping numbers to URIs and literals) and a set of six indexes: GSPO, GPOS, GOSP, SPOG, POSG, OSPG. Each of the indexes contains a full copy of the (Subject, Predicate, Object, Graph) quads, just in a different sorted order.

Andrae Muys, Netymon, a specialist supplier of technology development and consultancy services, introduced his ideas on XA2, a redesign of Mulgara’s storage layer. Major goals are to support concurrent writes and up to 100 Billion triples in one instance (two orders of magnitude above what Mulgara can achieve today). Read Muys’ discussion paper:

Six Reasons Why Mulgara’s XA2 Storage Layer Should Matter to You

Muys proposes replacing the string pool with a Trie-based data structure capable of storing 100 billion+ strings. He also suggests a novel approach for the indexes using redundant binary numbers. This approach would reduce the cost of updates to the indexes.

A series of triplestore-related talks hosted by SDForum in Palo Alto. David Wood gave an overview of his company (Zepheira) and talked about Mulgara goals. Andy Palmer (co-founder of Vertica, which uses C-Store) and Sam Madden (co-developer of C-Store at MIT) gave an overview of how Vertica can be used as a read-optimized triplestore.

Ideas for incremental improvements to Mulgara that would improve performance were presented. One idea was to change the string pool so it doesn’t constantly bother with re-allocation of space. This would keep “dirty” data on disk for longer, deferring cleanup to

a later time. Another idea was to use BTrees for the indexes. Currently, Mulgara employs AVLTrees. AVLTrees have the advantage of always being balanced (so they’re always optimized for read performance), but they are relatively expensive to re-balance when

deletes occur. Paul also presented an interesting “out of the box” idea on storing all the indexes as pairs: (subjectID, statementID), (predicateID, statementID), (objectID, statementID), versus triples (subject, predicate, object). This would give every statement in the store a unique ID (think “reification”), but he made a case for why it would actually be more performant than what we have today.

James Leigh introduced the idea of supporting Sesame’s SAIL api in Mulgara. SAIL is similar in purpose to Trippi, but it supports transactions and uses the OpenRDF APIs (as opposed to JRDF, which is falling out of favor). The Mulgara team was generally receptive to the idea. From a technical perspective, it appears the challenge will be mapping the abstract syntax of iTQL to that of SAIL (which is currently the union of SERQL and SPARQL). From a social perspective, there is some concern that the use of SAIL (traditionally

seen as “the Sesame API”) may weaken Mulgara’s brand. However, there is general recognition of the need for a (transactional) JDBC-like API for working with triplestores, and if this API were supported by both Sesame and Mulgara, it would be a win for the community, and would attract more users to Mulgara.

Contributed by Chris Wilper, Lead Software Developer, Fedora Commons

Paris, France The Bibliotheque Nationale de France (BnF) with it’s novel architecture based on the visual concept of four open books standing upright and facing each to make an interior square in downtown Paris, is built on seven centuries of history and scholarship in five locations. Vast collections include rare and ancient texts as well as audio visual collections, image banks and interactive services. BnF has contracted with Atos Origin, an international information technology services company, to create a Fedora-based repository system. A multi-institutional team is planning a Fedora architecture for the BnF project using OAIS reference model standards. For more information contact Thornton Staples.

Stockholm, Sweden The National Library of Sweden will host a Scandinavian Fedora Users Group Meeting on June 24-25, 2008. Meeting details and a call for proposals will follow. Contact Eva Müller for more information.

Ihtaca, NY A paper by Dean B. Krafft, Aaron Birkland, and Ellen J. Cramer that provides details about NCore—an open source architecture and software platform for creating flexible, collaborative digital libraries—is now available. This paper will be presented at the Joint Conference on Digital Libraries (JCDL) 2008 in Pittsburgh, PA.
NCore was developed by the National Science Digital Library (NSDL)NSDL Annotation project, and serves as the central technical infrastructure for NSDL. NCore consists of a central Fedora-based digital repository, a specific data model, an API, and a set of backend services and frontend tools that create a new model for collaborative, contributory digital libraries. This paper describes NCore, presents and analyzes its architecture, tools and services; reports on the experience of NSDL in building and operating a major digital library with NCore during the past year; and the experience of the Digital Library for Earth Systems Education in porting their existing digital library and tools to the NCore platform.

Ithaca, NY Neil Beagrie, a leading expert on digital preservation and curation with an international reputation across the archive, library, science and research sectors in the long-term management of digital assets, will join Sandy Payette, Executive Director, and Eddie Shin, Senior Software Developer, Fedora Commons, in delivering plenary lectures at “Fedora Day”—the Fedora User Group Meeting—at OR08NSDL Annotation in Southampton on April 3, 2008.
Payette and Shin will present an overview of Fedora Commons organizational and technical
progress. Beagrie will reflect on current digital preservation trends, review Fedora’s history in support of long-term durability of digital content, and provide repository developers and managers with suggestions for effective digital preservation strategies.
Neil Beagrie has played a major role in digital preservation for several organizations including founding the Digital Preservation Coalition and overseeing development of the UK Digital Curation Centre: he was also the international consultant to the U.S. National Digital Information Infrastructure and Preservation ProgramNSDL Annotation (NDIIPP). His career has spanned a range of senior information management roles at the British Library, the JISC, the Arts and Humanities Data Service, and the Royal Commission on the Historical Monuments of England.

By Sandy Payette, Executive Director, Fedora Commons

Welcome to the first issue of HatCheck, a quarterly newsletter published by Fedora Commons. Our goal is to make HatCheck a great place for news from our growing community, progress reports on our open-source software development, special features stories, and upcoming events. Kudos to Jim Blake for the double entendre embodied in our newsletter name!

For both newcomers and long-term users of our open source software, it’s always important to reflect on the significance of our “hat” origins. The Flexible, Extensible Digital Object Repository Architecture (Fedora) was designed at Cornell University in the late 1990s in anticipation of the need for a naturally evolving system (i.e., flexible and extensible) for managing and disseminating rich and heterogeneous digital content. From 2001-2007, the Fedora Project (Cornell and University of Virginia) embraced this design to create the internationally recognized open-source Fedora repository software that can be found at the core of many innovative systems including digital libraries, scientific and scholarly e-research platforms, digital archives, and more. From the Web perspective, the Fedora repository software is notable for its ability to provide a robust repository-based underpinning for web applications, including traditional Web apps, Web 2.0-style apps, and increasing Semantic Web-style apps.

In May 2007, with support from the Gordon and Betty Moore Foundation and the Andrew W. Mellon Foundation, we established Fedora Commons as an independent non-profit organization. Fedora Commons now serves as the home base for continued innovative software development and new community organizing activities. Fedora Commons promotes the use of open-source software to support management and sharing of our most important digital assets, especially digital content that forms the record of our shared intellectual, scientific, and cultural knowledge. We address fundamental questions pertaining to access and management of digital content such as whether our information is enduring and re-usable.

We now invite you to contemplate our Fedora Commons motivations, and to join us in the process of expanding and enriching our software and our community to meet new and exciting challenges in the years to come.

To build our Fedora Commons community, we have created new Outreach and Communication functions under the leadership of Thorny Staples and Carol Minton Morris. The HatCheck newsletter is just one of many mechanisms we are working on to enable information exchange and collaboration within the Fedora Commons community. The uptake of the Fedora open source software is clearly accelerating, with a wide variety of institutions reporting installations of Fedora. With the startup of Fedora Commons, we have begun a systematic analysis of our existing user community. To share this information, we have started a community registry on the Fedora Commons wiki which can be accessed here. We are looking forward to hearing from more users since we have observed over 20,000 downloads of our software in the last 12 months.

We encourage you to register your organization in our community registry, and contact us directly to discuss your interest in being part of our community. Also, consider inspiring others with your work by showcasing your system or project in upcoming issues of HatCheck, or on our website.

From the software development perspective, we are creating new processes and tools to enable community developers to engage and collaborate with the Fedora Commons core team. These efforts are under the leadership of Dan Davis and Chris Wilper. We are currently in the process of finalizing our software development roadmap which will be published this month on the Fedora Commons website. The roadmap will be a living document that will communicate our software development plans as they evolve. The roadmap is created with input from the community, including strategic partners, community councils, and feedback from users like you.

We are also pleased to report on several new community collaborations that will help us evolve the Fedora Commons software in key solution areas. Among these new and notable strategic collaborations are:

- Johns Hopkins University and Cornell University, to focus on data curation

- Topaz and the Public Library of Science, to focus on open access publishing

- Mulgara, to focus on highly scalable semantic technologies

- Sun Microsystems, to focus on robust storage for preservation and archiving

- OAI Object Reuse and Exchange, to focus on re-use and interoperability

- FIZ Karlsruhe, focusing on scalability and benchmarking

We look forward to these collaborations and many other community-based software development activities. Also, stay tuned on the Fedora users and developers email lists for announcements about the roadmap, new collaboration tools, and improved documentation.

Links mentioned in this piece:

Web Site: http://www.fedora-commons.org

Our Mission: http://www.fedora-commons.org/about/

Community Registry: http://fedora.info/wiki/index.php/Fedora_Commons_Community_Registry

Email Lists: http://sourceforge.net/mail/?group_id=177054