SNParser

About

This is a small script can help to annotate your SNPs. The program take on input your SNP-ID from the VCF file and generate a report based on SNPedia (https://www.snpedia.com/) data.

Steps

1. File preparation

Firstly, you need to convert the file for reading. To do this, filter out the unique SNP-IDs by bash:

Note: If you want to select clinically relevant SNPs, you can use the Clinwar database - download a VCF with ClinVar variants (https://ftp.ncbi.nlm.nih.gov/pub/clinvar/vcf_GRCh38/clinvar.vcf.gz). You should also download SnpSift (https://pcingola.github.io/SnpEff/)

java -jar SnpSift.jar annotate clinvar.vcf snps.vcf > snps_snpsift_clinvar.vcf

# Filtering out the necessary columns in the original vcf file
egrep -v '^#|^$' snps_snpsift_clinvar.vcf | cut -f 1-6,10 > snp_python.txt

# Select the identifiers of SNPs
awk '($32!="-")' snps_snpsift_clinvar.txt | grep risk_factor | cut -f 1-3,19 | sort | uniq > sorted_SNP.txt
cut -f 3 sorted_SNP.txt | uniq > zzz.txt
cut -d ';' -f1 zzz.txt > SNP_identificators.txt

2. The main part

Use SNParser.py script and enjoy.

Note: The input script accepts two files: SNP_identificators.txt and you 'raw' VCF snp_python.txt in TXT format!

As a result you should to get html file, which will contain something like that:

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
Python_script		Python_script
README.md		README.md
leonov_v_d_home_project1_Vladislav_Leonov_chek.ipynb		leonov_v_d_home_project1_Vladislav_Leonov_chek.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SNParser

About

Steps

1. File preparation

2. The main part

About

Releases

Languages

FLinT3/SNParser

Folders and files

Latest commit

History

Repository files navigation

SNParser

About

Steps

1. File preparation

2. The main part

About

Topics

Resources

Stars

Watchers

Forks

Releases

Languages