Show simple item record

dc.contributor.authorOmanović, Amra
dc.contributor.authorKazan, Hilal
dc.contributor.authorOblak, Polona
dc.contributor.authorCurk, Tomaž
dc.date.accessioned2022-04-26T06:38:15Z
dc.date.available2022-04-26T06:38:15Z
dc.date.issued2021
dc.identifier.citationOmanović, A., Kazan, H., Oblak, P. & Curk, T. (2021). Sparse data embedding and prediction by tropical matrix factorization. BMC Bioinformatics, 22(89), 1-18.en_US
dc.identifier.issn1471-2105
dc.identifier.urihttp://hdl.handle.net/20.500.12566/1178
dc.description.abstractBackground Matrix factorization methods are linear models, with limited capability to model complex relations. In our work, we use tropical semiring to introduce non-linearity into matrix factorization models. We propose a method called Sparse Tropical Matrix Factorization (STMF) for the estimation of missing (unknown) values in sparse data. Results We evaluate the efficiency of the STMF method on both synthetic data and biological data in the form of gene expression measurements downloaded from The Cancer Genome Atlas (TCGA) database. Tests on unique synthetic data showed that STMF approximation achieves a higher correlation than non-negative matrix factorization (NMF), which is unable to recover patterns effectively. On real data, STMF outperforms NMF on six out of nine gene expression datasets. While NMF assumes normal distribution and tends toward the mean value, STMF can better fit to extreme values and distributions. Conclusion STMF is the first work that uses tropical semiring on sparse data. We show that in certain cases semirings are useful because they consider the structure, which is different and simpler to understand than it is with standard linear algebra.en_US
dc.description.sponsorshipThis work is supported by the Slovene Research Agency, Young Researcher Grant (52096) awarded to AO, and research core funding (P1-0222 to PO and P2-0209 to TC).en_US
dc.language.isoengen_US
dc.publisherBMC Bioinformaticsen_US
dc.rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectData embeddingen_US
dc.subjectVeri gömmetr_TR
dc.subjectMatrix factorizationen_US
dc.subjectMatris çarpanlarına ayırmatr_TR
dc.subjectTropical factorizationen_US
dc.subjectTropical semiringen_US
dc.subjectSparse dataen_US
dc.subjectSeyrek veritr_TR
dc.subjectMatrix completionen_US
dc.subjectMatris tamamlamatr_TR
dc.titleSparse data embedding and prediction by tropical matrix factorizationen_US
dc.typeinfo:eu-repo/semantics/articleen_US
dc.relation.publicationcategoryInternational publicationen_US
dc.identifier.wosWOS:000624528900003
dc.identifier.scopus2-s2.0-85101785747
dc.identifier.volume22
dc.identifier.issue89
dc.identifier.startpage1
dc.identifier.endpage18
dc.contributor.orcid0000-0003-2461-4579 [Kazan, Hilal]
dc.contributor.abuauthorKazan, Hilal
dc.contributor.yokid107780 [Kazan, Hilal]
dc.contributor.ScopusAuthorID35094213400 [Kazan, Hilal]
dc.identifier.PubMedID33632116
dc.identifier.doi10.1186/s12859-021-04023-9


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record