KATKA: A KRAKEN-like tool with k given at query time

Int Symp String Process Inf Retr. 2022 Nov:13617:191-197. doi: 10.1007/978-3-031-20643-6_14. Epub 2022 Nov 1.

Abstract

We describe a new tool, KATKA, that stores a phylogenetic tree T such that later, given a pattern P [ 1 . . m ] and an integer k , it can quickly return the root of the smallest subtree of T containing all the genomes in which the k-mer P [ i . . i + k - 1 ] occurs, for 1 i m - k + 1 . This is similar to KRAKEN's functionality but with k given at query time instead of at construction time.