10.17632/F5Y9CGGNXY.2
Fernandes, Marcelo A. C.
Marcelo A. C.
Fernandes
k-mers 1D and 2D representation dataset of SARS-CoV-2 nucleotide sequences
Mendeley
2020
Dataset
Bioinformatics
FOS: Computer and information sciences
Digital Signal Processing
Computational Bioinformatics
Barbosa, Raquel De M.
Raquel De M.
Barbosa
2020-05-26
10.17632/f5y9cggnxy
Creative Commons Attribution 4.0 International
The dataset provides five types of k-mers genome representation characterized as k-mers count 1D, k-mers probability 1D, k-mers count 2D, k-mers probability 2D, and k-mers image. The dataset is composed of 1557 virus instances of SARS-CoV-2. Besides, the dataset also provides a data stream of 11540 viruses from the Virus-Host DB dataset and the other three Riboviria viruses from NCBI (Betacoronavirus RaTG13, bat-SL-CoVZC45, and bat-SL-CoVZXC21).