Selected Document Management Publications
1999
- J. Jaakkola, P. Kilpeläinen, G. Lindén, J. Niemi, and K. Paasiala:
TranSID: an SGML document manipulation language -
Reference manual.
Department of Computer Science
Report C-1999-35,
University of Helsinki, June 1999.
- J. Jaakkola and P. Kilpeläinen:
Nested Text-Region Algebra.
Department of Computer Science
Report C-1999-2,
University of Helsinki, January 1999.
- P. Kilpeläinen:
SGML & XML content models.
Markup Languages: Theory & Practise,
Vol. 1, No. 2 (Spring 1999), 53-76.
(A preliminary version available as
Report C-1998-12,
University of Helsinki, Department of Computer
Science, May 1998.)
1998
- Helena Ahonen, Barbara Heikkinen, Oskari Heinonen, Jani Jaakkola,
Pekka Kilpeläinen, and Greger Lindén.
Design and implementation of a document assembly workbench.
In Electronic Publishing, Artistic Imaging and Digital Typography,
Roger D. Hersch, Jacques André, and Heather Brown (eds.),
Proceedings of the
EP'98 and RIDT'98 Conferences,
St Malo, France, March 30 - April 3, 1998.
Lecture Notes in Computer
Science Series, Vol. 1375,
Springer-Verlag: Heidelberg, 1998, 476-486.
- Helena Ahonen, Barbara Heikkinen, Oskari Heinonen, Jani Jaakkola,
Mika Klemettinen.
Analysis of Document Structures for Element Type Classification.
To appear in the Proceedings of the 4th International
Workshop on Principles of Digital Document Processing, PODDP '98,
St Malo, France, March 1998.
- Helena Ahonen, Oskari Heinonen, Mika Klemettinen, A. Inkeri Verkamo.
Applying Data Mining Techniques for Decriptive Phrase Extraction in
Digital Document Collections.
In Proceedings of the IEEE Forum on Research and Technology
Advances in Digital Libraries (ADL'98),
Santa Barbara, Ca, USA, April 22-24, 1998,
pages 2-11, IEEE Computer Society, Los Alamitos 1998.
- Helena Ahonen. Features of Knowledge Discovery Systems.
InterCHANGE, The Newsletter of the International SGML Users'
Group, 4(2), April 1998, 15-16.
- Pekka Kilpeläinen.
Kirjojen sisältöinformaation hyödyntäminen.
(The utilization of the content information of books, in Finnish.)
Chapter 4 in Oppikirjan digitaalitulevaisuus,
final report of the SÄÄTÖ II -project.
The Finnish National Board of Education, 1998.
- Oskari Heinonen.
Optimal Multi-Paragraph Text Segmentation by Dynamic Programming.
To appear in Proceedings of the COLING-ACL '98 Conference.
- Pekka Kilpeläinen.
SGML & XML content models.
Department of Computer Science
Report C-1998-12,
University of Helsinki, May 1998.
An extended version submitted for publication in
Markup Languages: Theory & Practise.
1997
- Helena Ahonen, Barbara Heikkinen, Oskari Heinonen, and Pekka
Kilpeläinen. A System for Assembling Specialized Textbooks from a Pool of
Documents. Technical Report C-1997-22, University of Helsinki,
Department of Computer Science, March 1997.
- Helena Ahonen, Oskari Heinonen, Mika Klemettinen, and
A. Inkeri Verkamo. Applying
Data Mining Techniques in Text Analysis. Technical Report C-1997-23,
University of Helsinki, Department of Computer Science, March 1997.
- Helena Ahonen, Barbara Heikkinen, Oskari Heinonen, and Mika
Klemettinen. Improving the accessibility of SGML documents - A content-analytical approach.
To appear in SGML Europe '97, Barcelona, Spain, May 1997. GCA.
- Greger Lindén. Structured Document
Transformations. PhD Thesis,
Report A-1997-2, Department of Computer Science, University
of Helsinki, June 1997. 122 pages.
- Jani Jaakkola, Pekka Kilpeläinen, and Greger Lindén.
TranSID: An SGML Tree Transformation Language. Technical Report
C-1997-36, University
of Helsinki, Department of Computer Science, May 1997. Also published in
the Proceedings of The Fifth Symposium on Programming Languages and Software Tools,
Jyväskylä, Finland, June 7-8, 1997, ed. Jukka Paakki, pages 72-83,
Technical Report C-1997-37, University
of Helsinki, Department of Computer Science, June 1997.
- Helena Ahonen, Oskari Heinonen, Mika Klemettinen, and
A. Inkeri Verkamo. Mining
in the Phrasal Frontier. Technical Report C-1997-14, University
of Helsinki, Department of Computer Science, February 1997. A
revised version to appear in Proceedings of PKDD'97 - 1st European
Symposium on Principles of Data Mining and Knowledge Discovery,
Trondheim, Norway, June 1997.
-
Helena Ahonen, Barbara Heikkinen, Oskari Heinonen, and Pekka
Kilpeläinen.
Assembling documents from digital libraries.
In Database and Expert Systems Applications,
Proceedings of the
8th International Conference, DEXA '97, A. Hameurlain and A.M. Tjoa
(eds.),
Springer Lecture Notes in Computer Science 1308,
419-429.
- Lasse Akselin and Pekka Kilpeläinen.
Tekstialuekyselyiden graafinen esittäminen.
(Graphical text region queries, in Finnish.)
In: SGML Finland 1997 -seminaarijulkaisu, K. Rytkönen (ed.),
Vaasa, Finland, October 1997. SGML User Group Finland, 1997.
109-122.
- Helena Ahonen, Barbara Heikkinen, Oskari Heinonen, Mika Klemettinen.
Discovery of reasonably-sized fragments using inter-paragraph
similarities. Department of Computer Science Report C-1997-67.
University of Helsinki, November 1997.
1996
- Helena Ahonen, Barbara Heikkinen, Oskari Heinonen, Jani Jaakkola,
Pekka Kilpeläinen, Greger Lindén, and Heikki Mannila. Intelligent Assembly
of Structured Documents. Technical Report C-1996-40, University of
Helsinki, Department of Computer Science, June 1996.
- Helena Ahonen. Automatic
generation of SGML content models. In Electronic Publishing
'96, Palo Alto, California, USA, September 1996.
- Helena Ahonen. Disambiguation
of SGML content models. In Proceedings of the PODP'96 Workshop
on the Principles of Document Processing, Palo Alto, California,
USA, September 1996.
- Pekka Kilpeläinen and Derick Wood. SGML and exceptions.
In Proceedings of the PODP'96 Workshop on the Principles of
Document Processing, Palo Alto, California, USA, September 1996.
- Helena Ahonen, Barbara Heikkinen, Oskari Heinonen, Jani Jaakkola,
Pekka Kilpeläinen, Greger Lindén, and Heikki Mannila. Constructing tailored SGML documents. In
J. Saarela, editor, Proceedings of SGML Finland
1996, Espoo, Finland, October 1996. SGML User Group Finland,
pages 106-116.
- Jani Jaakkola and Pekka Kilpeläinen. Using sgrep for querying
structured text files.
In J. Saarela, editor, Proceedings of SGML Finland
1996, Espoo, Finland, October 1996. SGML User Group Finland,
pages 56-67. Available as a short
abstract and as Technical Report C-1996-83.
- Helena Ahonen. Generating Grammars for Structured Documents Using
Grammatical Inference Methods. Ph.D. Thesis, University of Helsinki,
Department of Computer Science, Series of Publications A,
Report A-1996-4, November 1996.
- Greger Lindén, Henry Tirri and A. Inkeri Verkamo. ALCHEMIST: A
General Purpose Transformation Generator. To appear in Software -
Practice and Experience, 1996. See Technical
Report C-1995-43 for an extended version of this paper.
1995
- Pekka Kilpeläinen and Heikki Mannila. Ordered and unordered tree
inclusion. SIAM Journal on Computing, Vol. 24, No. 2, April 1995,
340-356.
- Greger Lindén, Henry Tirri. ALCHEMIST - the handbook, version 1.08.
Deliverable UH/T416/2, ESPRIT-II Project P5365 VITAL, April 1995.
- G.E. Blake, M.P. Consens, I.J. Davis, P. Kilpeläinen, E.Kuikka,
P.-Å. Larson, T. Snider and F.W. Tompa. Text/relational
database management systems: Overview and proposed SQL extensions.
Report CS-95-25, UW Centre for the New OED and Text Research, Department
of Computer Science, University of Waterloo, June 1995.
- J. Jaakkola and P. Kilpeläinen. Sgrep - A Tool to Search
Structured Text. Manuscript in preparation. University of Helsinki,
August 1995.
- Greger Lindén, Henry Tirri and A. Inkeri Verkamo. ALCHEMIST:
A General Purpose Transformation Generator. Technical Report
C-1995-43, Department of Computer Science, University of Helsinki,
September 1995.
1994
- Helena Ahonen. Generating grammars for structured documents using
grammatical inference methods. Ph.Lic. Thesis. Report C-1994-65,
Department of Computer Science, University of Helsinki, 1994.
- Helena Ahonen, Heikki Mannila, and Erja Nikunen. Forming grammars
for structured documents: an application of grammatical inference. In
R. Carrasco and J. Oncina, editors, Proceedings of the Second
International Colloquium on Grammatical Inference and Applications,
Lecture Notes in Artificial Intelligence 862, pages 153-167.
Springer-Verlag, 1994.
- Helena Ahonen, Heikki Mannila, and Erja Nikunen. Generating
grammars for SGML tagged texts lacking DTD. In M. Murata and H.
Gallaire, editors, Proceedings of the Workshop on Principles of
Document Processing '94, Darmstadt, 1994. To be published also in
Computer and Mathematical Modelling.
- G.E. Blake, M.P. Consens, P. Kilpeläinen, P.-Å. Larson, T.
Snider, and F.W. Tompa. Text/relational database management systems:
Harmonizing SQL and SGML. In W. Litwin and T. Risch, editors,
Applications of Databases, Proceedings of the First International
Conference, ADB-94, pages 267-280, Springer-Verlag, 1994.
- Pekka Kilpeläinen and Heikki Mannila. Query primitives for
tree-structured data. In M. Crochemore and D. Gusfield, editors,
Proceedings of the 1994 Symposium on Combinatorial Pattern
Matching, pages 213-225, Springer-Verlag, 1994.
- P. Kilpeläinen and D. Wood. Exceptions in SGML document grammars.
Manuscript. University of Waterloo, August 1994.
- Henry Tirri and Greger Lindén. ALCHEMIST
- an object-oriented tool to build transformations between Heterogeneous
Data Representations. In Hesham-El-Rewini and Brice D. Shriver,
editors, Proceedings of the Twenty-Seventh Annual Hawaii International
Conference on System Sciences, volume II, pages 226-235, Maui, Hawaii,
January 1994. IEEE Computer Society Press.
1993
- Helena Ahonen, Heikki Mannila, and Erja Nikunen. Forming grammars for
structured documents. Proceedings of the 1993 Workshop on Knowledge
Discovery in Databases, Washington, D.C., July 1993.
- Helena Ahonen, Heikki Mannila and Erja Nikunen. Interactive forming
of grammars. Technical Report C-1993-17, Department of Computer Science,
University of Helsinki, 1993.
- Pekka Kilpeläinen, and Heikki Mannila. Retrieval from hierarchical
documents using partial patterns. In R. Korfhage,
E. Rasmussen, and P. Willett, editors, ACM SIGIR '93: Proceedings of
the 16th Annual International Conference on Research and Development in
Information Retrieval, Pittsburgh, PA, USA, June 1993, pages 214-222.
- Greger Lindén. Incremental
updates in structured documents. Ph.Lic. Thesis, Report
C-1993-19, Department of Computer Science, University of Helsinki, April
1993.
1992
- Pekka Kilpeläinen. Tree
Matching Problems with Applications to Structured Text databases.
Ph.D. Thesis, Report A-1992-6, Department of Computer Science, University
of Helsinki, 1992.
- Pekka Kilpeläinen and Heikki Mannila. Grammatical tree matching.
In A. Apostolico et al., editors, Combinatorial Pattern
Matching, Third Workshop, Tucson, Arizona, April 1992.
Springer-Verlag, pages 202-214.
1991
- Pekka Kilpeläinen and Heikki Mannila. The tree inclusion problem.
In S. Abramsky and T.S.E. Maibaum, editors, TAPSOFT '91,
Brighton, England, April 1991, pages 202-214.
- Pekka Kilpeläinen and Heikki Mannila. Ordered and unordered tree
inclusion. Report A-1991-4, University of Helsinki, Department of
Computer Science, August 1991.
1990
- Pekka Kilpeläinen, Greger Lindén, Heikki Mannila, and Erja Nikunen.
A
structured text database system. In Richard Furuta, editor, EP90
- Proceedings of the International Conference on Electronic Publishing,
Document Manipulation & Typography, Gaithersburg,
Maryland, September 1990. The Cambridge Series on Electronic
Publishing, Cambridge University Press, pages 139-151.
DocMan Group.
Last updated August 19, 1998.