ABSTRACT
The complexity and diversity of government regulations make understanding the regulations a non-trivial task. One of the issues is the existence of multiple sources of regulations and interpretive guides; the latter are often independent of governing bodies. This work aims to develop an information infrastructure for legal information retrieval with applications to electronic-rulemaking. The pilot study focuses on accessibility regulations from the US Federal government, private organizations and European agencies. A shallow parser is developed to consolidate different regulations into a unified XML format, which is well suited for handling semi-structured data such as legal documents. Handcrafted rules and a text mining tool are developed to extract the important features, such as concepts, measurements, effective dates and so on, and to incorporate them into the corpus.To compare and locate related provisions from different regulatory documents, we employ Information Retrieval techniques to combine generic features with domain knowledge. Structural information from regulations, such as the hierarchical organization of provisions and heavy referencing among provisions, are used to help improve the relatedness analysis. Results are obtained to illustrate the use of regulatory structure and domain knowledge in provision comparisons. Application to an e-rulemaking scenario for a rights-of-way drafted regulation is shown to demonstrate extended capabilities of the prototype system.
- Al-Kofahi, K., Tyrrell, A., Vachher, A., and Jackson, P. A Machine Learning Approach to Prior Case Retrieval. In Proceedings of the 8th International Conference on Artificial Intelligence and Law (ICAIL 2001) (St. Louis, Missouri, 2001), 2001, 88--93. Google ScholarDigital Library
- Americans with Disabilities Act (ADA) Accessibility Guidelines for Buildings and Facilities. US Architectural and Transportation Barriers Compliance Board (Access Board), Washington, DC, 1999.Google Scholar
- Baeza-Yates, R., and Ribeiro-Neto, B. Modern Information Retrieval. ACM Press, New York, NY, 1999. Google ScholarDigital Library
- Bellman, R. E. Adaptive Control Processes. Princeton University Press, Princeton, NJ, 1961.Google Scholar
- Bench-Capon, T. J. M. Knowledge Based Systems and Legal Applications. Academic Press Professional, Inc., San Diego, CA, 1991. Google ScholarDigital Library
- Bishop, C. M. Neural Networks for Pattern Recognition. Oxford University Press; Clarendon Press, New York, NY, 1995. Google ScholarDigital Library
- Boer, A., Hoekstra, R., and Winkels, R. METALex: Legislation in XML. In Proceedings of Jurix 2002: 15th Annual International Conference on Legal Knowledge and Information Systems (London, UK, 2002). IOS Press, 2002, 1--10.Google Scholar
- Bolioli, A., Dini, L., Mercatali, P., and Romano, F. For the Automated Mark-Up of Italian Legislative Texts in XML. In Proceedings of Jurix 2002: 15th Annual International Conference on Legal Knowledge and Information Systems (London, UK, 2002). ISO Press, 2002, 21--30.Google Scholar
- Bollacker, K. D., Lawrence, S., and Giles, C. L. CiteSeer: An Autonomous Web Agent for Automatic Retrieval and Identification of Interesting Publications. In Proceedings of the 2nd International Conference on Autonomous Agents (Minneapolis, MN, 1998). ACM Press, 1998, 116--123. Google ScholarDigital Library
- Branting, L. K. Building Explanations from Rules and Structured Cases. International Journal of Man-Machine Studies, 34, 6 (1991), 797--837. Google ScholarDigital Library
- Branting, L. K. Reasoning with Portions of Precedents. In Proceedings of the 3rd International Conference on Artificial Intelligence and Law (ICAIL 1991) (Oxford, England, 1991). ACM Press, 1991, 145--154. Google ScholarDigital Library
- Brin, S., and Page, L. The Anatomy of a Large-Scale Hypertextual Web Search Engine. In Proceedings of the 7th International World Wide Web Conference (Brisbane, Australia, 1998), 1998, 107--117. Google ScholarDigital Library
- British Standard 8300. British Standards Institution (BSI), London, UK, 2001.Google Scholar
- Brüninghaus, S., and Ashley, K. D. Improving the Representation of Legal Case Texts with Information Extraction Methods. In Proceedings of the 8th International Conference on Artificial Intelligence and Law (ICAIL 2001) (St. Louis, Missouri, 2001), 2001, 42--51. Google ScholarDigital Library
- Calado, P., Ribeiro-Neto, B., Ziviani, N., Moura, E., and Silva, I. Local versus Global Link Information in the Web. ACM Transactions on Information Systems (TOIS), 21, 1 (2003), 42--63. Google ScholarDigital Library
- California Building Code (CBC). California Building Standards Commission, Sacramento, CA, 1998.Google Scholar
- Daniels, J. J., and Rissland, E. L. What You Saw Is What You Want: Using Cases to Seed Information Retrieval. In Proceedings of the 2nd International Conference on Case-Based Reasoning (ICCBR-97) (Providence, RI, 1997), 1997, 325--336. Google ScholarDigital Library
- Deerwester, S., Dumais, S. T., Furnas, G. W., Landauer, T. K., and Harshman, R. Indexing by Latent Semantic Analysis. Journal of the American Society of Information Science, 41, 6 (1990), 391--407.Google ScholarCross Ref
- Dörre, J., Gerstl, P., and Seiffert, R. Text Mining: Finding Nuggets in Mountains of Textual Data. In Proceedings of ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (San Diego, CA, 1999). ACM Press, 1999, 398--401. Google ScholarDigital Library
- Draft Guidelines for Accessible Public Rights-of-Way. US Architectural and Transportation Barriers Compliance Board (Access Board), Washington, DC, 2002.Google Scholar
- Engers, T. M. v., and Vanlerberghe, R. A. W. The POWER-Light Version: Improving Legal Quality under Time Pressure. In Proceedings of EGOV 2002: the 1st International Conference on Electronic Government (Aixen-Provence, France, 2002), 2002, 75--83. Google ScholarDigital Library
- Garfield, E. New International Professional Society Signals the Maturing of Scientometrics and Informetrics. The Scientist, 9, 16 (1995).Google Scholar
- Gentner, D., and Markman, A. B. Structure Mapping in Analogy and Similarity. American Psychologist, 52, 1 (1997), 45--56.Google Scholar
- Gibbens, M. P. CalDAG 2000: California Disabled Accessibility Guidebook. Builder's Book, Canoga Park, CA, 2000.Google Scholar
- Golub, G. H., and Van Loan, C. F. Matrix Computations. The Johns Hopkins University Press, Baltimore, MD, 1983.Google Scholar
- Gurrin, C., and Smeaton, A. F. A Connectivity Analysis Approach to Increasing Precision in Retrieval from Hyperlinked Documents. In Proceedings of Text REtrieval Conference (TREC) (Gaithersburg, MD, 1999), 1999.Google Scholar
- International Building Code 2000. International Conference of Building Officials (ICBO), Whittier, CA, 2000.Google Scholar
- Kleinberg, J. Authoritative Sources in a Hyperlinked Environment. In Proceedings of the 9th ACM-SIAM Symposium on Discrete Algorithms (San Francisco, CA, 1998), 1998, 668--677. Google ScholarDigital Library
- Lau, G. A Comparative Analysis Framework for Semi-Structured Documents, with Applications to Government Regulations. Ph.D. Thesis, Stanford University, Stanford, CA, 2004. Google ScholarDigital Library
- Lau, G., Law, K., and Wiederhold, G. Similarity Analysis on Government Regulations. In Proceedings of the 9th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (Washington, DC, 2003). ACM Press, 2003, 111--117. Google ScholarDigital Library
- McLaren, B. M. Extensionally Defining Principles and Cases in Ethics: an AI Model. Artificial Intelligence, 150, 1--2 (2003), 145--181. Google ScholarDigital Library
- Miller, G. A., Beckwith, R., Fellbaun, C., Gross, D., and Miller, K. Five Papers on WordNet. Technical Report, Cognitive Science Laboratory, Princeton, NJ, 1993.Google Scholar
- Moens, M.-F., Uyttendaele, C., and Dumortier, J. Abstracting of Legal Cases: The SALOMON Experience. In Proceedings of the 6th International Conference on Artificial Intelligence and Law (Melbourne, Australia, 1997), 1997, 114--122. Google ScholarDigital Library
- Osborn, J., and Sterling, L. JUSTICE: A Judicial Search Tool Using Intelligent Concept Extraction. In Proceedings of the 7th International Conference on Artificial Intelligence and Law (ICAIL 1999) (Oslo, Norway, 1999), 1999, 173--181. Google ScholarDigital Library
- Page, L., Brin, S., Motwani, R., and Winograd, T. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report, Stanford University, Stanford, CA, 1998.Google Scholar
- Proceedings of Business Compliance One Stop Workshop (Small Business Administration, Queenstown, MD, 2002), 2002.Google Scholar
- Rissland, E. L. Dimension-Based Analysis of Hypotheticals from Supreme Court Oral Argument. In Proceedings of the 2nd International Conference on Artificial Intelligence and Law (ICAIL 1989) (Vancouver, Canada, 1989). ACM Press, 1989, 111--120. Google ScholarDigital Library
- Rissland, E. L., Ashley, K. D., and Loui, R. P. AI and Law: A Fruitful Synergy. Artificial Intelligence, 150, 1--2 (2003), 1--15. Google ScholarDigital Library
- Rissland, E. L., and Skalak, D. B. CABARET: Rule Interpretation in a Hybrid Architecture. International Journal of Man-Machine Studies, 34, 6 (1991), 839--887. Google ScholarDigital Library
- Rissland, E. L., Skalak, D. B., and Friedman, M. T. BankXX: A Program to Generate Argument Through Case-Base Research. In Proceedings of the 4th International Conference on Artificial Intelligence and Law (ICAIL 1993) (Amsterdam, The Netherlands, 1993). ACM Press, 1993, 117--124. Google ScholarDigital Library
- Salton, G. The Smart Retrieval System - Experiments in Automatic Document Processing. Prentice Hall, Englewood Cliffs, NJ, 1971. Google ScholarDigital Library
- Salton, G., and Buckley, C. Term-Weighting Approaches in Automatic Retrieval. Information Processing and Management, 24, 5 (1988), 513--523. Google ScholarDigital Library
- Salton, G., and McGill, M. Introduction to Modern Information Retrieval. McGraw-Hill, New York, NY, 1983. Google ScholarDigital Library
- Schweighofer, E., Rauber, A., and Dittenbach, M. Automatic Text Representation, Classification and Labeling in European Law. In Proceedings of the 8th International Conference on Artificial Intelligence and Law (ICAIL 2001) (St. Louis, Missouri, 2001), 2001, 78--87. Google ScholarDigital Library
- Semio Tagger. Semio Corporation, 2002. http://www.semio.com.Google Scholar
- Sergot, M. J., Sadri, F., Kowalski, R. A., Kriwaczek, F., Hammond, P., and Cory, H. T. The British Nationality Act as a Logic Program. Communications of the ACM, 29, 5 (1986), 370--386. Google ScholarDigital Library
- Shepard's Federal Citations. Shepards/Mcgraw-Hill, Colorado Springs, CO, 1990.Google Scholar
- Silva, I., Ribeiro-Neto, B., Calado, P., Moura, E., and Ziviani, N. Link-Based and Content-Based Evidential Information in a Belief Network Model. In Proceedings of the 23rd Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (Athens, Greece, 2000), 2000, 96--103. Google ScholarDigital Library
- Technical Standards. Scottish Executive, Edinburgh, Scotland, UK, 2001.Google Scholar
- Thompson, P. Automatic Categorization of Case Law. In Proceedings of the 8th International Conference on Artificial Intelligence and Law (ICAIL 2001) (St. Louis, Missouri, 2001), 2001, 70--77. Google ScholarDigital Library
- Uniform Federal Accessibility Standards (UFAS). US Architectural and Transportation Barriers Compliance Board (Access Board), Washington, DC, 1997.Google Scholar
- Zeleznikow, J., and Hunter, D. Building Intelligent Legal Information Systems. Kluwer Law and Taxation Publishers, Deventer, The Netherlands, 1994.Google Scholar
Index Terms
- Legal information retrieval and application to e-rulemaking
Recommendations
A relatedness analysis approach for regulation comparison and e-rulemaking applications
dg.o '05: Proceedings of the 2005 national conference on Digital government researchThe process of e-rulemaking with participation from the public involves a non-trivial task of sorting through and organizing a massive volume of electronically submitted comments. This research proposes to make use of available Information and ...
The Deliberative E-Rulemaking project (DeER): improving federal agency rulemaking via natural language processing and citizen dialogue
dg.o '08: Proceedings of the 2008 international conference on Digital government researchMany scholars believe that electronic rulemaking has great but largely untapped potential to expand the public's democratic input and improve federal agency regulatory rules. The existing federal rulemaking process, however, elicits many redundant and ...
REGNET: regulatory information management, compliance and analysis
ICEGOV '12: Proceedings of the 6th International Conference on Theory and Practice of Electronic GovernanceThis paper describes a research effort that aims to develop information infrastructure and tools to facilitate access, compliance and analysis of government regulations. It is well recognized that the complexity, diversity, and volume of government ...
Comments