Function noLZSS::count_factors_file_multiple_dna_w_rc

Function Documentation

size_t noLZSS::count_factors_file_multiple_dna_w_rc(const std::string &path)

Counts noLZSS factors in a DNA file with reverse complement awareness for multiple sequences.

Counts the number of noLZSS factors in a DNA file with reverse complement awareness for multiple sequences.

This function reads DNA text from a file and counts factors without storing them or loading the entire file into memory. It’s the most memory-efficient way to get factor counts for large DNA files with multiple sequences.

Reads DNA text from a file and counts noLZSS factors without storing them. This is the most memory-efficient way to get factor counts for large genomic files with multiple sequences.

See also

count_factors_multiple_dna_w_rc() for in-memory counting

See also

factorize_file_multiple_dna_w_rc() for getting the actual factors from a file

See also

count_factors_multiple_dna_w_rc() for in-memory counting

Note

This function builds the suffix tree directly from the file

Parameters:
  • path – Path to input file containing DNA text with multiple sequences

  • path – Path to the input file containing DNA text with multiple sequences

Returns:

Number of factors in the factorization

Returns:

Number of factors in the factorization