Author:
Chung Philip,Mowbray Andrew,Greenleaf Graham
Abstract
AbstractIn this article Philip Chung, Andrew Mowbray, and Graham Greenleaf, the Co-Directors of the Australasian Legal Information Institute (AustLII), explain the need for an open source search engine which can search simultaneously over legal materials in European languages and also in Asian languages, particularly those that require a ‘double byte’ representation, and the difficulties this task presents. A solution is proposed; the ‘u16a’ modifications to AustLII's open source search engine (Sino) which is used by many legal information institutes. Two implementations of the Sino u16A approach, on the Hong Kong Legal Information Institute (HKLII), for English and Chinese, and on the Asian Legal Information Institute (AsianLII), for multiple Asian languages, are described. The implementations have been successful, though many challenges (discussed briefly) remain before this approach will provide a full multi-lingual search facility.
Publisher
Cambridge University Press (CUP)
Subject
General Agricultural and Biological Sciences
Reference29 articles.
1. Pun K . (2003) ‘Processing Legal Documents in the Chinese-Speaking World: the Experience of HKLII’ Proc. Law via Internet Conference, 2003
2. Nguyen TV , Tran HK , Nguyen TTT and Nguyen H . (2006) ‘Word Segmentation for Vietnamese Text Categorization: An online corpus approach’ in Proceedings of 4th IEEE International Conference on Computer Science – Research, Innovation and Vision of the Future 2006 (RIVF'06), p. 172–178
3. Investigating the relationship between word segmentation performance and retrieval performance in Chinese IR
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Current Awareness;Legal Information Management;2012-12