.. _cli::p3-closest-seqs:


###############
p3-closest-seqs
###############

.. highlight:: perl


**********************************
Find Closest Sequences Using Kmers
**********************************


.. code-block:: perl

     p3-closest-seqs.pl [options] kmerDB


This script takes as input a file of DNA or protein sequences and compares them to a kmer database. The closest
groups within a certain threshold are listed. So, for example, given a kmer database of protein sequences for features
and an incoming file of new proteins, each new protein would be paired with the features in the kmer database with
which it is closest. Given a kmer database of DNA sequences for genomes and an incoming file of contigs, each incoming
contig would be paired with the genomes with which it is closest. Use :ref:`cli::p3-build-kmer-db` to build the kmer database.

Parameters
==========


The positional parameter is the file name of the kmer database.

The standard input can be overridden using the options in :ref:`cli-input-options`.

The default location for the incoming sequence is the last column of the input file. Use the options in :ref:`cli-column-options`
to change it.

The following additional options are supported.


min
 
 Minimum number of kmer matches for a sequence to be considered close. The default is \ ``40``\ .