An information extraction model for recommending the most applied case
- Authors: Padayachy, Thashen
- Date: 2019
- Subjects: Information technology , Information storage and retrieval systems System design
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: http://hdl.handle.net/10948/43325 , vital:36794
- Description: The amount of information produced by different domains is constantly increasing. One domain that particularly produces large amounts of information is the legal domain, where information is mainly used for research purposes. However, too much time is spent by legal researchers on searching for useful information. Information is found by using special search engines or by consulting hard copies of legal literature. The main research question that this study addressed is “What techniques can be incorporated into a model that recommends the most applied case for a field of law?”. The Design Science Research (DSR) methodology was used to address the research objectives. The model developed is the theoretical contribution produced from following the DSR methodology. A case study organisation, called LexisNexis, was to help investigate the real-world problem. The initial investigation into the real-world problem revealed that too much time is spent on searching for the Most Applied Case (MAC) and no formal or automated processes were used. An analysis of an informal process followed by legal researchers enabled the identification of different concepts that could be combined to create a prescriptive model to recommend the MAC. A critical analysis of the literature was conducted to obtain a better understanding of the legal domain and the techniques that can be applied to assist with problems faced in this domain, related to information retrieval and extraction. This resulted in the creation of an IE Model based only on theory. Questionnaires were sent to experts to obtain a further understanding of the legal domain, highlight problems faced, and identify which attributes of a legal case can be used to help recommend the MAC. During the Design and Development activity of the DSR methodology, a prescriptive MAC Model for recommending the MAC was created based on findings from the literature review and questionnaires. The MAC Model consists of processes concerning: Information retrieval (IR); Information extraction (IE); Information storage; and Query-independent ranking. Analysis of IR and IE helped to identify problems experienced when processing text. Furthermore, appropriate techniques and algorithms were identified that can process legal documents and extract specific facts. The extracted facts were then further processed to allow for storage and processing by query-independent ranking algorithms. The processes incorporated into the model were then used to create a proof-of-concept prototype called the IE Prototype. The IE Prototype implements two processes called the IE process and the Database process. The IE process analyses different sections of a legal case to extract specific facts. The Database process then ensures that the extracted facts are stored in a document database for future querying purposes. The IE Prototype was evaluated using the technical risk and efficacy strategy from the Framework for Evaluation of Design Science. Both formative and summative evaluations were conducted. Formative evaluations were conducted to identify functional issues of the prototype whilst summative evaluations made use of real-world legal cases to test the prototype. Multiple experiments were conducted on legal cases, known as source cases, that resulted in facts from the source cases being extracted. For the purpose of the experiments, the term “source case” was used to distinguish between a legal case in its entirety and a legal case’s list of cases referred to. Two types of NoSQL databases were investigated for implementation namely, a graph database and a document database. Setting up the graph database required little time. However, development issues prevented the graph database from being successfully implemented in the proof-of-concept prototype. A document database was successfully implemented as an alternative for the proof-of-concept prototype. Analysis of the source cases used to evaluate the IE Prototype revealed that 96% of the source cases were categorised as being partially extracted. The results also revealed that the IE Prototype was capable of processing large amounts of source cases at a given time.
- Full Text:
- Date Issued: 2019
- Authors: Padayachy, Thashen
- Date: 2019
- Subjects: Information technology , Information storage and retrieval systems System design
- Language: English
- Type: Thesis , Masters , MSc
- Identifier: http://hdl.handle.net/10948/43325 , vital:36794
- Description: The amount of information produced by different domains is constantly increasing. One domain that particularly produces large amounts of information is the legal domain, where information is mainly used for research purposes. However, too much time is spent by legal researchers on searching for useful information. Information is found by using special search engines or by consulting hard copies of legal literature. The main research question that this study addressed is “What techniques can be incorporated into a model that recommends the most applied case for a field of law?”. The Design Science Research (DSR) methodology was used to address the research objectives. The model developed is the theoretical contribution produced from following the DSR methodology. A case study organisation, called LexisNexis, was to help investigate the real-world problem. The initial investigation into the real-world problem revealed that too much time is spent on searching for the Most Applied Case (MAC) and no formal or automated processes were used. An analysis of an informal process followed by legal researchers enabled the identification of different concepts that could be combined to create a prescriptive model to recommend the MAC. A critical analysis of the literature was conducted to obtain a better understanding of the legal domain and the techniques that can be applied to assist with problems faced in this domain, related to information retrieval and extraction. This resulted in the creation of an IE Model based only on theory. Questionnaires were sent to experts to obtain a further understanding of the legal domain, highlight problems faced, and identify which attributes of a legal case can be used to help recommend the MAC. During the Design and Development activity of the DSR methodology, a prescriptive MAC Model for recommending the MAC was created based on findings from the literature review and questionnaires. The MAC Model consists of processes concerning: Information retrieval (IR); Information extraction (IE); Information storage; and Query-independent ranking. Analysis of IR and IE helped to identify problems experienced when processing text. Furthermore, appropriate techniques and algorithms were identified that can process legal documents and extract specific facts. The extracted facts were then further processed to allow for storage and processing by query-independent ranking algorithms. The processes incorporated into the model were then used to create a proof-of-concept prototype called the IE Prototype. The IE Prototype implements two processes called the IE process and the Database process. The IE process analyses different sections of a legal case to extract specific facts. The Database process then ensures that the extracted facts are stored in a document database for future querying purposes. The IE Prototype was evaluated using the technical risk and efficacy strategy from the Framework for Evaluation of Design Science. Both formative and summative evaluations were conducted. Formative evaluations were conducted to identify functional issues of the prototype whilst summative evaluations made use of real-world legal cases to test the prototype. Multiple experiments were conducted on legal cases, known as source cases, that resulted in facts from the source cases being extracted. For the purpose of the experiments, the term “source case” was used to distinguish between a legal case in its entirety and a legal case’s list of cases referred to. Two types of NoSQL databases were investigated for implementation namely, a graph database and a document database. Setting up the graph database required little time. However, development issues prevented the graph database from being successfully implemented in the proof-of-concept prototype. A document database was successfully implemented as an alternative for the proof-of-concept prototype. Analysis of the source cases used to evaluate the IE Prototype revealed that 96% of the source cases were categorised as being partially extracted. The results also revealed that the IE Prototype was capable of processing large amounts of source cases at a given time.
- Full Text:
- Date Issued: 2019
Ray Charles: a psychobiographical study
- Authors: Biggs, Ilze
- Date: 2008
- Subjects: Charles, Ray, 1930-2004 Psychology -- Biographical methods -- Case studies Jazz singers -- Biography Blind entertainers -- Psychology
- Language: English
- Type: Thesis , Masters , MA
- Identifier: vital:2933 , http://hdl.handle.net/10962/d1002442
- Description: Psychobiography is the formulation of an individual's narrative according to a psychological theory. Psychobiographical researchers face a number of challenges. One pertinent challenge is the limited amount of psychobiographical research conducted at academic institutions, including South Africa. Although a number of studies had been completed in the past decade, the impact of psychobiographical research remains negligible. Although much has been written about Ray Charles, none of the existing literature adopted a specific psychological focus. Charles developed from a young boy in a poverty stricken, racially segregated society into an exceptionally successful musician who worked productively until he died at the age of 73. He was selected as the subject on the basis of interest value, uniqueness and significance of life achievements. The primary aim of this study was to explore and describe the development of Charles according to Levinson's (Levinson, et. ai, 1978) theoretical framework. Levinson's theory of adult development identifies and describes the important changes that occur throughout the lifespan of an individual. A secondary aim was to provide an understanding of Charles within the social, economic and historical context in which he lived. The data collection and analysis was conducted according to Yin's (2003) 'analytic generalization'. The data was analysed according to three linked sub-processes proposed by Huberman and Miles (1994).
- Full Text:
- Date Issued: 2008
- Authors: Biggs, Ilze
- Date: 2008
- Subjects: Charles, Ray, 1930-2004 Psychology -- Biographical methods -- Case studies Jazz singers -- Biography Blind entertainers -- Psychology
- Language: English
- Type: Thesis , Masters , MA
- Identifier: vital:2933 , http://hdl.handle.net/10962/d1002442
- Description: Psychobiography is the formulation of an individual's narrative according to a psychological theory. Psychobiographical researchers face a number of challenges. One pertinent challenge is the limited amount of psychobiographical research conducted at academic institutions, including South Africa. Although a number of studies had been completed in the past decade, the impact of psychobiographical research remains negligible. Although much has been written about Ray Charles, none of the existing literature adopted a specific psychological focus. Charles developed from a young boy in a poverty stricken, racially segregated society into an exceptionally successful musician who worked productively until he died at the age of 73. He was selected as the subject on the basis of interest value, uniqueness and significance of life achievements. The primary aim of this study was to explore and describe the development of Charles according to Levinson's (Levinson, et. ai, 1978) theoretical framework. Levinson's theory of adult development identifies and describes the important changes that occur throughout the lifespan of an individual. A secondary aim was to provide an understanding of Charles within the social, economic and historical context in which he lived. The data collection and analysis was conducted according to Yin's (2003) 'analytic generalization'. The data was analysed according to three linked sub-processes proposed by Huberman and Miles (1994).
- Full Text:
- Date Issued: 2008
- «
- ‹
- 1
- ›
- »