Call for participation: Towards Digital Mathematics Library (DML 2008)
July 27th, 2008, Birmingham, UK c/o MKM 2008
Workshop webpage: http://www.fi.muni.cz/~sojka/dml-2008.xhtml
Registration: http://events.cs.bham.ac.uk/cicm08/registration.php
Travel: http://events.cs.bham.ac.uk/cicm08/travel.php
Accomodation: http://events.cs.bham.ac.uk/cicm08/accommodation.php
Overview:
Mathematicians dream of a digital archive containing all peer-reviewed
mathematical literature ever published, properly linked and
validated/verified. It is estimated that the entire corpus of mathematical
knowledge published over the centuries does not exceed 100,000,000
pages, an amount easily manageable by current information technologies.
The workshop's objectives are to formulate the strategy and goals of
a global mathematical digital library and to summarize the current
successes and failures of ongoing technologies and related projects,
asking such questions as:
# What technologies, standards, algorithms and formats should be used
and what metadata should be shared?
# What business models are suitable for publishers of mathematical
literature, authors and funders of their projects and institutions?
# Is there a model of sustainable, interoperable, and extensible
mathematical library that mathematicians can use in their everyday work?
# What is the best practice for
* retrodigitized mathematics (from images via OCR to MathML and/or TeX);
* retro-born-digital mathematics (from existing electronic copy in
DVI, PS or PDF to MathML and/or TeX);
* born-digital mathematics (how to make needed metadata and file formats
available as a side effect of publishing workflow [CEDRAM model])?
Proceedings:
(200+ pages) was published by Masaryk University and will be available on site.
Papers and Posters selected for presentation at the workshop:
Part I Towards Digital Mathematics Library
* Some Thoughts on the Near-Future Digital Mathematics Library
Thierry Bouche (Université de Grenoble I, France)
* From Pixels and Minds to the Mathematical Knowledge in a Digital Library
Petr Sojka (Masaryk University, Brno, Czech Republic)
Jiří Rákosník (Institute of Mathematics AS CR, Praha, Czech Republic)
* Mathematical Document Classification via Symbol Frequency Analysis
Stephen M. Watt (University of Western Ontario, London Ontario, Canada)
Part II Towards Mathematical OCR and Search
* Lexical Error Compensation in Handwritten-based Mathematical Information Retrieval
Seyed Ali Ahmadi (George Washington University, United States)
Abdou Youssef (George Washington University, United States)
* Extending Full Text Search Engine for Mathematical Content
Jozef Mišutka (Charles University in Prague, Czech Republic)
Leo Galamboš (Charles University in Prague, Czech Republic)
* Mathematical Formulae Recognition
Daniel Průša (Czech Technical University, Prague, Czech Republic)
Václav Hlaváč (Czech Technical University, Prague, Czech Republic)
* Extracting Precise Data on the Mathematical Content of PDF Documents
Josef B. Baker (University of Birmingham, United Kingdom)
Alan P. Sexton (University of Birmingham, United Kingdom)
Volker Sorge (University of Birmingham, United Kingdom)
Part III Digitization Reports
* RusDML 2008
Bernd Wegner (Mathematisches Institut, TU Berlin, Germany)
* Digitization of Mathematical Editions in Serbia
Žarko Mijajlović (Faculty of Mathematics, Belgrade, Serbia)
Zoran Ognjanović (Mathematical Institute, Belgrade, Serbia)
* Current Status of Mathematical Publications in Japan
Takao Namiki (Hokkaido University, Sapporo, Japan)
* Small Scale Retrodigitization
Michael Doob (The University of Manitoba, Canada)
Part IV Digitization Technologies and Platforms
* Building the Czech Digital Mathematics Library upon DSpace System
Vlastimil Krejčíř (Masaryk University, Brno, Czech Republic)
* Migration of the Mathematical Collection of Polish Virtual Library of Science to the YADDA Platform
Katarzyna Zamlynska (University of Warsaw, Poland)
Lukasz Bolikowski (University of Warsaw, Poland)
Tomasz Rosiek (University of Warsaw, Poland)
* A Language Engineering Architecture for Processing Informal Mathematical Discourse
Magdalena Wolska (Saarland University, Germany)
Part V Digitization Tools
* DML-CZ Metadata Editor
Miroslav Bartošek (Masaryk University, Brno, Czech Republic)
Petr Kovář (Masaryk University, Brno, Czech Republic)
Martin Šárfy (Masaryk University, Brno, Czech Republic)
* CEDRICS: When CEDRAM Meets Tralics (Keynote)
Thierry Bouche (Université de Grenoble I & CNRS, France)
* Automated Processing of TEX-Typeset Articles for a Digital Library
Michal Růžička (Masaryk University, Brno, Czech Republic)
Workshop Overview:
Mathematicians dream of a digital archive containing all peer-reviewed
mathematical literature ever published, properly linked and
validated/verified. It is estimated that the entire corpus of mathematical
knowledge published over the centuries does not exceed 100,000,000
pages, an amount easily manageable by current information technologies.
The workshop's objectives are to formulate the strategy and goals of
a global mathematical digital library and to summarize the current
successes and failures of ongoing technologies and related projects,
asking such questions as:
# What technologies, standards, algorithms and formats should be used
and what metadata should be shared?
# What business models are suitable for publishers of mathematical
literature, authors and funders of their projects and institutions?
# Is there a model of sustainable, interoperable, and extensible
mathematical library that mathematicians can use in their everyday work?
# What is the best practice for
* retrodigitized mathematics (from images via OCR to MathML and/or TeX);
* retro-born-digital mathematics (from existing electronic copy in
DVI, PS or PDF to MathML and/or TeX);
* born-digital mathematics (how to make needed metadata and file formats
available as a side effect of publishing workflow [CEDRAM model])?
Topics:
(include, but are not limited to)
o search, indexing and retrieval of mathematical documents
o ranking of mathematical papers, similarity of mathematical documents
o math OCR with MathML/TeX output
o document conversions from/to MathML, OpenMath, LaTeX,
PostScript and [tagged] PDF
o conversions between various mathematical formalisms
o mathematical document compression
o processing of scanned images
o algorithms for crosslinking of bibliographical items,
intext citations search
o mathematical document classification, MSC 2010
o mathematical text mining
o mathematical documents metadata exchange via OAI-PMH and/or OAI-ORE
o long term archiving, data migration
o reports and experience from math digitization projects
o math publishing with long term archival goal
o software engineering aspects of creating, handling MathML,
OMDoc, OpenMath documents, and displaying them in web browsers
Programme Committee:
Jose Borbinha (Technical University of Lisbon, IST, PT)
Thierry Bouche (University Grenoble, Cellule Mathdoc, FR)
Thomas Fischer (Goettingen University, Digitization Center, DE)
Vaclav Hlavac (Czech Technical University, Faculty of Engineering, Prague, CZ)
Janka Chlebikova (Comenius University, MFF, Bratislava, SK)
Enrique Macias-Virgos (University of Santiago de Compostella, ES)
Jiri Rakosnik (Academy of Sciences, Mathematical Institute, Prague, CZ)
Eugenio Rocha (University of Aveiro, Dept. of Mathematics, PT)
David Ruddy (Cornell University, Library, US)
Petr Sojka (Masaryk University, Faculty of Informatics, Brno, CZ) [chair]
Volker Sorge (University of Birmingham, UK)
Masakazu Suzuki (Kyushu University, Faculty of Mathematics, JP)
Bernd Wegner (Technical University & Zentralblatt MATH, Berlin, DE)
Organizing Committee:
Adam Rambousek, Michal Ruzicka, Petr Sojka, Volker Sorge
Questions/inquiries:
mail to dml2008 at easychair dot org
CFP distribution:
Please, distribute at your place. And apologies for multiple postings!