Data Structures & Algorithms
Q1. (5 pts): Write the line of code that would install the package ggplot2 in R.
Q2. (10 pts): Compare and contrast VCF and GFF files.
Q3. (10 pts): Compare & Contrast FASTA and FASTQ files.
Q4. (5 pts): What are comment lines in an algorithm? When should they be used?
Q5. (15 pts): What are SAM, BAM, and BAI files? What are the differences between them files? How are they generated?
Q6. (20 pts): For a given algorithm, how will BigO change if the algorithm is run on different hardware (e.g. a faster processor)? How will it change if a larger data set is run through the algorithm? Explain.
Q7. (25 pts): Using pseudocode, describe an algorithm that sums the values between 1 and N for a given value of N. What do you think the BigO will be for this algorithm and why?
Q8. (10 pts): Many bioinformatics data file formats utilize plain text as a storage format.
- List 5 bioinformatics data formats that are in fact simply structured plain text
- List 3 different editors that would work well for editing Plain Text (Hint: MS Word is NOT good for editing plain text!).