-
Notifications
You must be signed in to change notification settings - Fork 276
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
"The index file is older than the data file" when opening 1000 Genomes VCF #877
Comments
Pysam 0.15.4: bug is actual. |
That warning message is from htslib, and has been in pysam's bundled htslib since 2014. It has not changed. Presumably when you saw no message from previous versions of pysam, your previous copies of these files had different timestamps. What are the timestamps on the /path_to/ALL.chr6_GRCh38.genotypes.20170504.vcf.gz* files on your machine? I expect the warning is accurate. You should The timestamps for these files on ftp.1000genomes.ebi.ac.uk make it appear that the .tbi index may be out of date:
so if your downloaded files reflect these timestamps (e.g., you used |
Thanks for the explanation. |
- В тот раз отсев разноразмерных повторных вставок я реализовал неправильно. Новый алгоритм уж точно верный. - Пополнение CHROM-POS-ID-таблицы конвертационной БД больше не производится кусками. Не так уж и много туда загоняется данных, чтобы возникало опасение перерасхода оперативной памяти. - Временно обошёл проблему индексации chrX-файла новыми версиями Pysam (samtools/bcftools#1154). Теперь для chrX будет качаться готовый индекс из FTP 1000 Genomes. Почему я не стал так делать для всех 1000G-архивов? Тогда htslib заспамил бы вас многочисленными ворнингами (см. pysam-developers/pysam#877). - Сделал код проверки существования тех или иных файлов более элегантным.
elementary OS 5.1
pysam 0.15.3
Download any 1000 Genomes VCF:
ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/supporting/GRCh38_positions/ALL.chr6_GRCh38.genotypes.20170504.vcf.gz
Download corresponding index:
ftp://ftp.1000genomes.ebi.ac.uk/vol1/ftp/release/20130502/supporting/GRCh38_positions/ALL.chr6_GRCh38.genotypes.20170504.vcf.gz.tbi
Try to create a VariantFile object:
from pysam import VariantFile
variant_file_obj = VariantFile('/path_to/ALL.chr6_GRCh38.genotypes.20170504.vcf.gz')
[W::hts_idx_load2] The index file is older than the data file: /path_to/ALL.chr6_GRCh38.genotypes.20170504.vcf.gz.tbi
[W::hts_idx_load2] The index file is older than the data file: /path_to/ALL.chr6_GRCh38.genotypes.20170504.vcf.gz.tbi
There was no error in the previous version of pysam.
The text was updated successfully, but these errors were encountered: