[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Project "Automatic enrichment of OAI metadata" at Bielefeld



*sorry for crossposting*

Dear Colleagues,

we would like to announce a project, which is currently being 
carried out at Bielefeld University Library:

"Automatic enrichment of OAI metadata by means of computational 
linguistics methodology and developement of services for 
content-based integration of repositories"

Project description:

Having access to high-quality scientific information is an 
important prerequisite of scholarship. The increasing 
availability of electronic publications in content-stores 
(repositories) distributed over the internet and their 
aggregation within the framework of the Open Archives Initiative 
(OAI) substantially contribute to this already. The project 
"Automatic enrichment of OAI metadata by means of computational 
linguistics methodology and developement of services for 
content-based integration of repositories" funded by the German 
Research Foundation (Deutsche Forschungsgemeinschaft, DFG) aims 
at the enhancement of subject classification for these scientific 
documents using OAI metadata. The project is a cooperation 
between Bielefeld University Library, the Department of 
Computational Linguistics at Bielefeld University (Professor 
Alexander Mehler), and the Department of Computer Science at the 
University of Leipzig (Professor Gerhard Heyer).

Within the scope of the project, documents lacking sufficient 
classificatory information are to be classified automatically 
using different schemes, in a first step the Dewey Decimal 
Classification (DDC), by means of computational linguistics 
methodology. The classification information will be integrated 
into the metadata and can then be used in different contexts, 
e.g. it can be returned to the repositories or included in 
scientific search engines like the Bielefeld Academic Search 
Engine (BASE). The data will be made available to other 
organizations for further re-use. Eventually, the normalized data 
shall facilitate the semantic integration of distributed 
repositories. Semantic browsing and search will become feasible 
and improve the quality of electronic literature search and 
retrieval.

The project combines the fields of digital libraries and text 
technology (computational linguistics): The University Library 
provides access to high-quality document collections through a 
standardized interface, whereas the text technology delivers the 
linguistically based classification results.

Project management:  Dr. Wolfram Horstmann, CIO Scientific 
Information at  Bielefeld University

Contact and further information:  Mathias Loesch
mathias.loesch@uni-bielefeld.de

Project funding:  Deutsche Forschungsgemeinschaft
Duration: 2 years (starting October 2009)

If you have any questions or comments, please do not hesitate to 
contact me.

Kind regards,
Mathias Loesch

-- 
Mathias Loesch
Bielefeld University Library