Function noLZSS::write_factors_binary_file_multiple_dna_w_rc
Defined in File factorizer.cpp
Function Documentation
-
size_t noLZSS::write_factors_binary_file_multiple_dna_w_rc(const std::string &in_path, const std::string &out_path)
Writes noLZSS factors from a DNA file with reverse complement awareness for multiple sequences to a binary output file.
This function reads DNA text from an input file, performs factorization with reverse complement for multiple sequences, and writes the resulting factors in binary format to an output file. Each factor is written as three uint64_t values (start position, length, ref).
Reads DNA text from an input file containing multiple sequences, performs factorization with reverse complement support, and writes the factors in binary format to an output file.
Note
Binary format: each factor is 24 bytes (3 × uint64_t: start, length, ref)
Note
This function overwrites the output file if it exists
Warning
Ensure sufficient disk space for the output file
Note
Binary format: each factor is written as three uint64_t values (start, length, ref)
Note
Reverse complement factors have RC_MASK set in the ref field
Warning
This function overwrites the output file if it exists
- Parameters:
in_path – Path to input file containing DNA text with multiple sequences
out_path – Path to output file where binary factors will be written
in_path – Path to input file containing DNA text with multiple sequences
out_path – Path to output file where binary factors will be written
- Returns:
Number of factors written to the output file
- Returns:
Number of factors written to the output file