Template Function noLZSS::factorize_file_stream_multiple_dna_w_rc

Function Documentation

template<class Sink>
size_t noLZSS::factorize_file_stream_multiple_dna_w_rc(const std::string &path, Sink &&sink)

Factorizes DNA text from a file with reverse complement awareness for multiple sequences using the noLZSS algorithm.

Advanced factorization function for DNA files with reverse complement awareness for multiple sequences.

This template function reads DNA text directly from a file and performs factorization without loading the entire file into memory, considering both forward and reverse complement matches. This is more memory-efficient for large files.

This template function reads DNA text from a file containing multiple sequences and provides low-level access to the factorization process through a sink callable.

See also

factorize_file_multiple_dna_w_rc() for the non-template version that returns a vector

Note

This function builds the suffix tree directly from the file

Template Parameters:
  • Sink – Callable type that accepts Factor objects

  • Sink – Callable type that accepts Factor objects

Parameters:
  • path – Path to input file containing DNA text with multiple sequences

  • sink – Callable that receives each computed factor

  • path – Path to input file containing DNA text with multiple sequences

  • sink – Callable that receives each computed factor

Returns:

Number of factors emitted

Returns:

Number of factors emitted