Impacts of Frequent Itemset Hiding Algorithms on Privacy Preserving Data Mining

dc.contributor.advisor Ergenç, Belgin
dc.contributor.author Yıldız, Barış
dc.contributor.other 03.04. Department of Computer Engineering
dc.contributor.other 03. Faculty of Engineering
dc.contributor.other 01. Izmir Institute of Technology
dc.date.accessioned 2014-07-22T13:50:45Z
dc.date.available 2014-07-22T13:50:45Z
dc.date.issued 2010
dc.description Thesis (Master)--Izmir Institute of Technology, Computer Engineering, Izmir, 2010 en_US
dc.description Includes bibliographical references (leaves: 54-58) en_US
dc.description Text in English; Abstract: Turkish and English en_US
dc.description x, 69 leaves en_US
dc.description.abstract The invincible growing of computer capabilities and collection of large amounts of data in recent years, make data mining a popular analysis tool. Association rules (frequent itemsets), classification and clustering are main methods used in data mining research. The first part of this thesis is implementation and comparison of two frequent itemset mining algorithms that work without candidate itemset generation: Matrix Apriori and FP-Growth. Comparison of these algorithms revealed that Matrix Apriori has higher performance with its faster data structure. One of the great challenges of data mining is finding hidden patterns without violating data owners. privacy. Privacy preserving data mining came into prominence as a solution. In the second study of the thesis, Matrix Apriori algorithm is modified and a frequent itemset hiding framework is developed. Four frequent itemset hiding algorithms are proposed such that: i) all versions work without pre-mining so privacy breech caused by the knowledge obtained by finding frequent itemsets is prevented in advance, ii) efficiency is increased since no pre-mining is required, iii) supports are found during hiding process and at the end sanitized dataset and frequent itemsets of this dataset are given as outputs so no post-mining is required, iv) the heuristics use pattern lengths rather than transaction lengths eliminating the possibility of distorting more valuable data. en_US
dc.identifier.uri https://hdl.handle.net/11147/3037
dc.language.iso en en_US
dc.publisher Izmir Institute of Technology en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject.lcsh Data minning en
dc.title Impacts of Frequent Itemset Hiding Algorithms on Privacy Preserving Data Mining en_US
dc.type Master Thesis en_US
dspace.entity.type Publication
gdc.author.institutional Yıldız, Barış
gdc.author.institutional Ergenç Bostanoğlu, Belgin
gdc.coar.access open access
gdc.coar.type text::thesis::master thesis
gdc.description.department Thesis (Master)--İzmir Institute of Technology, Computer Engineering en_US
gdc.description.publicationcategory Tez en_US
gdc.description.scopusquality N/A
gdc.description.wosquality N/A
relation.isAuthorOfPublication 3b51d444-157d-4dff-a209-e28543a80dcd
relation.isAuthorOfPublication.latestForDiscovery 3b51d444-157d-4dff-a209-e28543a80dcd
relation.isOrgUnitOfPublication 9af2b05f-28ac-4014-8abe-a4dfe192da5e
relation.isOrgUnitOfPublication 9af2b05f-28ac-4004-8abe-a4dfe192da5e
relation.isOrgUnitOfPublication 9af2b05f-28ac-4003-8abe-a4dfe192da5e
relation.isOrgUnitOfPublication.latestForDiscovery 9af2b05f-28ac-4014-8abe-a4dfe192da5e

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
T000856.pdf
Size:
1.04 MB
Format:
Adobe Portable Document Format
Description:
MasterThesis

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: