Perform Alignment-free k-tuple frequency comparisons from sequences. This can be in the form of two input files (e.g. a reference and a query) or a single file for pairwise comparisons to be made.
Use RAM to store pairwise comparisons. (TBH, why would you want to do a PW comparison on something THAT big, and if it is that big and you do it on disk, then it would take forever despite using no RAM).
The main thing to do here is ensure we check the amount of RAM FIRST and then fail fast with a warning.
e.g. making it so that the correct options are available to user with the correct data. So now we've included protein data as an input (which I think is great) I do think we need to decide which options (e.g. reverse compliment, distance measures, kmer sizes etc) are available to the user. E.g. I could easily auto detect if we have nucleotide or protein data and then give a warning about the options for example.