Document Type


Publication Date



During the summer of 2020, I worked as a student researcher with the Summer Science Research Institute (SSRI) at Connecticut College in the field of Bioinformatics. The main goal of the research was to create a software program using python that assembles large genomes by using two types of DNA sequencing technologies, one that generates high-quality, but short, sequences and one that creates low-quality, but long sequences. My individual responsibility in the research was to examine the Phred quality scores which are assigned by a machine to each DNA base-pair within each sequence used to assemble the genomes. My specific role in this research project was to code a program using python that would convert these quality scores to values between 0 and 1.


The views expressed in this paper are solely those of the author.