Top detective
糖心原创 doctoral student Soon Jye Kho named top performer in FDA challenge to detect medical mislabeling
March 22, 2019
March 22, 2019
糖心原创 student Soon Jye Kho was named one of the top performers in an international research challenge by the U.S. Food and Drug Administration designed to detect medical mislabeling through computer analysis.
Kho鈥檚 entry was among the three top-performing submissions out of 82 submitted by academic and private researchers from around the world in the .
Researchers taking part in the challenge hailed from Denmark, Korea, Luxembourg as well as the Cleveland Clinic and U.S. universities, including Texas Tech and the University of Michigan.
鈥淚 feel excited about it and kind of proud,鈥 said Kho, whose research area is using computer science to analyze medical data. 鈥淚鈥檝e been able to utilize what I鈥檝e learned in a practical real-life scenario.鈥
Kho, who grew up in Malaysia, is a Ph.D. student at . His adviser is Amit Sheth, the LexisNexis Ohio Eminent Scholar, a professor of , and the executive director of Kno.e.sis.
鈥淜ho鈥檚 win is significant in two ways,鈥 said Sheth. 鈥淔irst, it represents one more win for a Kno.e.sis student in a national or an international level competition. Second, it is an important addition to our growing body of research in precision medicine and personalized digital health, topics of significant importance for Kno.e.sis鈥 role as an Ohio Center of Excellence in BioHealth Innovation.鈥
The online data challenge occurred from November to December. Researchers participated remotely. Top performers were announced in February.
FDA challenges are designed to find solutions to real-world problems. The objective of the mislabeling challenge was to encourage development and evaluation of computational algorithms that can accurately detect and correct mislabeled samples.
The accidental swapping of patient tissue samples or genetic data can contribute to invalid conclusions and wrong or ineffective treatments.
鈥淲hen a sample belongs to a healthy individual but is mislabeled as cancer tissue, the physician might prescribe unnecessary treatment to the individual, which could harm them,鈥 said Kho.
Participants in the challenge were presented with 160 tumor samples, with about 15 percent of them containing labeling errors. They were asked to create computational algorithms to model the relationship between clinical attributes, protein profiles and mRNA profiles using the data, then to apply the model to identify and correct mislabeled samples.
鈥淲e employed machine-learning techniques where we train the machine to see the pattern in a cancerous genetic profile and the normal genetic profile,鈥 said Kho. 鈥淪o if there is mismatch between machine prediction and a patient鈥檚 diagnosis, mislabeling is suspected and we could go back to reaffirm the labeling before performing any downstream analysis.鈥
Computer analysis is becoming a powerful approach to understanding disease and speeding the translation of new discoveries in the labs to patient care.
Kho believes the outcome of the challenge can have a real impact. After earning his Ph.D., he would like to conduct research in academia or the private sector.
鈥淚鈥檓 interested in precision medicines and translational research that speeds up findings in the lab to the clinical side,鈥 he said.