Auditing Keyword Queries Over Text Documents

Appareddy, Bharath Kumar Reddy (2017) Auditing Keyword Queries Over Text Documents. Masters thesis, Indian Institute of Technology Hyderabad.

[img] Text
CS15MTECH11001.pdf - Submitted Version
Restricted to Registered users only until 28 June 2020.

Download (769kB) | Request a copy

Abstract

Designing a robust data management system, requires securing access to all the three types of data, namely structured, semi-structured and unstructured data. In this paper, we present an auditing model to secure text document access. Given a sensitive docu- ment, we compute the candidate suspicious keyword queries that may have accessed the sensitive document. Our auditing model allows users to specify either the full document or some specific portion of the document as sensitive. All queries that have accessed a sensitive document may not lead to disclosure of the sensitive document, some of them might be very regular accesses. We present an outlier mining based algorithm to find top-k anomalous queries from candidate suspicious queries. Data security and privacy is an issue of growing importance in healthcare domain. Query auditing is often used in healthcare domain to detect privacy violation. However, as unstructured healthcare data, such as medical reports, etc., are not easily available for public research. In this paper, we show how one can use the publically available DBLP data to create an equivalent healthcare data, which can be used for experimental evaluation.

[error in script]
IITH Creators:
IITH CreatorsORCiD
Item Type: Thesis (Masters)
Uncontrolled Keywords: auditing modle, data mining, TD839
Subjects: Computer science > Computer programming, programs, data
Divisions: Department of Computer Science & Engineering
Depositing User: Team Library
Date Deposited: 29 Jun 2017 07:17
Last Modified: 29 Jun 2017 07:17
URI: http://raiith.iith.ac.in/id/eprint/3306
Publisher URL:
Related URLs:

Actions (login required)

View Item View Item
Statistics for RAIITH ePrint 3306 Statistics for this ePrint Item