From tags to topic maps : using marked-up Hebrew text to discover linguistic patterns

dc.contributor.authorKroeze, J.H. (Jan Hendrik)
dc.contributor.emailjan.kroeze@up.ac.zaen
dc.contributor.upauthorBothma, T.J.D. (Theodorus Jan Daniel)
dc.contributor.upauthorMatthee, Machdel C.
dc.date.accessioned2008-06-04T07:45:26Z
dc.date.available2008-06-04T07:45:26Z
dc.date.issued2008-05-18
dc.description.abstractThe paper discusses a series of related techniques that prepare and transform raw linguistic data for advanced processing in order to unveil hidden grammatical patterns. It identifies XML as a suitable mark-up language to build an exploitable data bank of multi-dimensional data in the Hebrew text of the Old Testament. This concept is illustrated by tagging a transcription of Gen. 1:1-2:3 and manipulating this data bank. Transferring the data into a three-dimensional array allows advanced processing of the data in order to either confirm existing knowledge or to mine for new, yet undiscovered, linguistic features. Visualisation is discussed as a technique that enhances interaction between the human researcher and the computerised technologies supporting this process of knowledge creation. The empirical study is a small experiment that illustrates the viability and usefulness of the proposed expert devices as well as the benefits of applying information system techniques to linguistic databases.en
dc.format.extent351636 bytes
dc.format.mimetypeapplication/pdf
dc.identifier.citationKroeze, JH ,Bothma, TJD, & Matthee, MC 2008, ' From tags to topic maps: using marked-up Hebrew text to discover linguistic patterns',Proceedings of the 2008 International Conference on Information Resources Management (Conf-IRM 2008),[http://www.sprott.carleton.ca/conf-irm/CFP2008.pdf]en
dc.identifier.isbn978-0-473-134455-7
dc.identifier.urihttp://hdl.handle.net/2263/5778
dc.language.isoenen
dc.publisherProceedings of the 2008 International Conference on Information Resources Managementen
dc.rightsProceedings of the 2008 International Conference on Information Resources Management (Conf-IRM 2008) Niagara Falls, Ontario, Canada, 18-20 May 2008en
dc.subjectText data miningen
dc.subjectData warehousingen
dc.subjectMOLAPen
dc.subjectXMLen
dc.subjectGenesisen
dc.subject.lcshHebrew language -- Data processing
dc.subject.lcshData mining
dc.subject.lcshData warehousing
dc.subject.lcshXML (Document markup language)en
dc.titleFrom tags to topic maps : using marked-up Hebrew text to discover linguistic patternsen
dc.typeArticleen

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Kroeze_TopicMaps (2008).pdf
Size:
343.39 KB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
2.39 KB
Format:
Item-specific license agreed upon to submission
Description: