Function noLZSS::count_factors_file_multiple_dna_w_rc
Defined in File factorizer.cpp
Function Documentation
-
size_t noLZSS::count_factors_file_multiple_dna_w_rc(const std::string &path)
Counts noLZSS factors in a DNA file with reverse complement awareness for multiple sequences.
Counts the number of noLZSS factors in a DNA file with reverse complement awareness for multiple sequences.
This function reads DNA text from a file and counts factors without storing them or loading the entire file into memory. It’s the most memory-efficient way to get factor counts for large DNA files with multiple sequences.
Reads DNA text from a file and counts noLZSS factors without storing them. This is the most memory-efficient way to get factor counts for large genomic files with multiple sequences.
See also
count_factors_multiple_dna_w_rc() for in-memory counting
See also
factorize_file_multiple_dna_w_rc() for getting the actual factors from a file
See also
count_factors_multiple_dna_w_rc() for in-memory counting
Note
This function builds the suffix tree directly from the file
- Parameters:
path – Path to input file containing DNA text with multiple sequences
path – Path to the input file containing DNA text with multiple sequences
- Returns:
Number of factors in the factorization
- Returns:
Number of factors in the factorization