- Title
- An Evaluation of Text Mining Techniques in Sampling of Network Ports from IBR Traffic
- Creator
- Chindipha, Stones D
- Creator
- Irwin, Barry V W
- Creator
- Herbert, Alan
- Date Issued
- 2019
- Date
- 2019
- Type
- text
- Type
- article
- Identifier
- http://hdl.handle.net/10962/427630
- Identifier
- vital:72452
- Identifier
- https://www.researchgate.net/profile/Stones-Chindi-pha/publication/335910179_An_Evaluation_of_Text_Mining_Techniques_in_Sampling_of_Network_Ports_from_IBR_Traffic/links/5d833084458515cbd1985a38/An-Evaluation-of-Text-Mining-Techniques-in-Sampling-of-Network-Ports-from-IBR-Traffic.pdf
- Description
- Information retrieval (IR) has had techniques that have been used to gauge the extent to which certain keywords can be retrieved from a document. These techniques have been used to measure similarities in duplicated images, native language identification, optimize algorithms, among others. With this notion, this study proposes the use of four of the Information Retrieval Techniques (IRT/IR) to gauge the implications of sampling a/24 IPv4 ports into smaller subnet equivalents. Using IR, this paper shows how the ports found in a/24 IPv4 net-block relate to those found in the smaller subnet equivalents. Using Internet Background Radiation (IBR) data that was collected from Rhodes University, the study found compelling evidence of the viability of using such techniques in sampling datasets. Essentially, being able to identify the variation that comes with sampling the baseline dataset. It shows how the various samples are similar to the baseline dataset. The correlation observed in the scores proves how viable these techniques are to quantifying variations in the sampling of IBR data. In this way, one can identify which subnet equivalent best represents the unique ports found in the baseline dataset (IPv4 net-block dataset).
- Format
- 5 pages
- Format
- Language
- English
- Relation
- Proceedings of Southern African Telecommunication Networks and Applications Conference (SATNAC)
- Relation
- Chindipha, S.D., Irwin, B. and Herbert, A., 2019. An Evaluation of Text Mining Techniques in Sampling of Network Ports from IBR Traffic. In The Changing Face of Telcos in a Digital World. Southern Africa Telecommunication Networks and Applications Conference (SATNAC)
- Relation
- Proceedings of Southern African Telecommunication Networks and Applications Conference (SATNAC) volume 2019 number 1 1 5 2019 Conference
- Rights
- Publisher
- Rights
- Use of this resource is governed by the terms and conditions of the Southern Africa Telecommunication Networks and Applications Conference (SA TNAC) Statement (https://www.satnac.org.za/)
- Hits: 103
- Visitors: 107
- Downloads: 6
Thumbnail | File | Description | Size | Format | |||
---|---|---|---|---|---|---|---|
View Details Download | SOURCE1 | An Evaluation of Text Mining Techniques in Sampling of Network Ports from IBR Traffic.pdf | 264 KB | Adobe Acrobat PDF | View Details Download |