Quasi-supervised strategies for compound-protein interaction prediction [Master Thesis]

Çakı, Onur

Please use this identifier to cite or link to this item: https://hdl.handle.net/11147/11684

Title:	Quasi-supervised strategies for compound-protein interaction prediction [Master Thesis]
Other Titles:	Bileşik-protein etkileşimi tahmini için yarı-güdümlü yaklaşımlar [Master Thesis]
Authors:	Çakı, Onur
Advisors:	Karaçalı, Bilge
Keywords:	Bioinformatics Cheminformatics Machine learning Quasi-supervised learning
Publisher:	01. Izmir Institute of Technology
Source:	Çakı, O. (2021). Quasi-supervised strategies for compound-protein interaction prediction. Unpublished master's thesis, İzmir Institute of Technology, İzmir, Turkey
Abstract:	In-silico prediction of compound-protein interaction using computational methods preserves its importance in various pharmacology applications because the wet-lab experiments are time-consuming, laborious and costly. Most machine learning methods proposed to that end approach this problem with supervised learning strategies in which known interactions are labeled as positive and the rest are labeled as negative. However, treating all unknown interactions as negative instances may lead to inaccuracies in real practice since some of the unknown interactions are bound to be positive interactions waiting to be identified as such. In this study, we propose to address this problem using the Quasi-Supervised Learning algorithm. In this framework, potential interactions are predicted by estimating the overlap between two datasets: a true positive dataset which consists of compound-protein pairs with known interactions and an unknown dataset which consists of all the remaining compound-protein pairs. The potential interactions are then identified as those in the unknown dataset that overlap with the interacting pairs in the true positive dataset in terms of the associated similarity structure between interacting pairs. Experimental results on GPCR and Nuclear Receptor datasets show that the proposed method can identify actual interactions from all possible combinations. Laboratuvar ortamında gerçekleştirilen bileşik-protein etkileşimi belirleme deneylerinin zaman alıcı, zahmetli ve maliyetli olması nedeniyle, hesaplamalı yöntemler kullanarak dijital ortamda bileşik-protein etkileşimi tahmini önemini korumaktadır. Bu amaçla geliştirilen pek çok yapay öğrenme yöntemi bu probleme bilinen etkileşimlerin pozitif, eldeki geri kalan bütün etkileşimlerin ise negatif olarak etiketlendiği güdümlü öğrenme stratejileri ile yaklaşmıştır. Fakat bilinmeyen etkileşimler açığa çıkarılmayı bekleyen pozitif etkileşimleri de barındıracağından, bilinmeyen bütün etkileşimleri negatif örnek olarak ele almak gerçek uygulamalarda hatalı sonuçlara yol açacaktır. Bu çalışmada, bu problemin Yarı-Güdümlü Öğrenme Algoritması ile çözülmesi amaçlanmaktadır. Bu çerçevede olası etkileşimler iki veri kümesinin örtüşümü kestirilerek tahmin edilir: Etkileştikleri bilinen bileşik-protein çiftlerinden oluşan gerçek pozitif veri kümesi ve geri kalan diğer bütün bileşik-protein çiftlerinden oluşan bilinmeyen veri kümesi. Gerçek pozitif veri kümesindeki etkileşen çiftlerle ilgili yapısal benzerlik açısından örtüşen bilinmeyen veri kümesindeki bileşik-protein çiftleri potansiyel etkileşimler olarak tanımlanır. GPCR ve Nuclear Receptor veri kümeleri üzerindeki deneysel sonuçlar, amaçlanan yöntemin bütün olası çiftlerden gerçek etkileşimleri saptayabildiğini göstermektedir.
Description:	Thesis (Master)--Izmir Institute of Technology, Electronics and Communication Engineering, Izmir, 2021 Includes bibliographical references (leaves: 54-59) Text in English; Abstract: Turkish and English
URI:	https://hdl.handle.net/11147/11684
Appears in Collections:	Master Degree / Yüksek Lisans Tezleri

Files in This Item:

File	Description	Size	Format
10410010.pdf	Master Thesis	2 MB	Adobe PDF	View/Open

Show full item record

CORE Recommender

Page view(s)

226

checked on Jul 22, 2024

Download(s)

466

checked on Jul 22, 2024

Google Scholar^TM

Check

Files in This Item:

Page view(s)

Download(s)

Google ScholarTM

Google Scholar^TM