The abundance (number of occurrences) of a read in a read set is an indicator value for read confidence in high-throughput sequencing studies.
Write pseudocode, Python code, and C++ code for the read abundance problem. Make two submissions, including the pseudocode as a comment to both the Python and the C++ code.
The input is a collection of strings (genomic sequence reads, possibly reverse complemented) over the alphabet .
The output is the sorted frequency distribution of .
Input
TCATC TTGAT TCATC TGAAA GATGA TTTCA ATCAA TTGAT TTTCA
Output
ATCAA 3 GATGA 3 TGAAA 3