Show simple item record

dc.contributor.advisorMitra, Debasis
dc.contributor.authorKolar, Michael Robert
dc.date.accessioned2023-01-03T13:47:51Z
dc.date.available2023-01-03T13:47:51Z
dc.date.created2022-12
dc.date.issued2022-12
dc.date.submittedDecember 2022
dc.identifier.urihttp://hdl.handle.net/11141/3614
dc.descriptionThesis (M.S.) - Florida Institute of Technology, 2022.en_US
dc.description.abstractOne of the difficulties of genetic research is the asymmetrical relationship between data collection techniques and data analysis techniques. The goal of this research was to test a novel application of non-negative matrix factorization, which would allow researchers to more easily identify co-mutations. Those co-mutations then can then be further verified by frequency analysis. This pruning process allows researchers to identify more fruitful research opportunities, saving time, energy, and funding. Past research has utilized non-negative matrix factorization to extract factors which meaningfully express underlying data features. This study extends the depth of non-negative matrix factorization knowledge in various ways. First, a novel cost function was utilized to convert raw genetic data into numerical values appropriate for matrix operations. Second, this research utilized the alternating non-negative least squares matrix factorization variant for its faster convergence time compared to the more traditional multiplicative update approach. Third, traditionally data sets were not factored at multiple factor counts, but this study extends previously established methods by performing an analysis over multiple factor counts. Fourth, this study suggests evidence that factors produced by non-negative matrix factorization contain co-mutations, which were verified by a statistical analysis. Fifth, this study demonstrated that non-negative matrix factorization has an unsupervised ability to partition a data set into chronologically separated clusters. This research indicates that non-negative matrix factorization is a scalable algorithm for identifying genetic co-mutations within a practical computational time frame.en_US
dc.format.mimetypeapplication/pdf
dc.language.isoen_USen_US
dc.rightsCopyright held by authoren_US
dc.titleNon-negative Matrix Factorization in the Identification of Co-mutationsen_US
dc.typeThesisen_US
dc.date.updated2023-01-03T13:27:11Z
thesis.degree.nameMaster of Science in Computer Scienceen_US
thesis.degree.levelMastersen_US
thesis.degree.disciplineComputer Scienceen_US
thesis.degree.departmentComputer Engineering and Sciencesen_US
thesis.degree.grantorFlorida Institute of Technologyen_US
dc.type.materialtext


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record