Tips on integration
1. QIIME 2 Integration with EUKARYOME
1.1. Importing EUKARYOME into QIIME 2
Initially, you should change the format to the “.qza” format and import the files into QIIME 2:
Importing the taxonomy file:
qiime tools import --type 'FeatureData[Taxonomy]' --input-path QIIME2_EUK_SSU_v1.8.tsv --output-path eukaryome_tax.qza
Importing the sequence file:
qiime tools import --input-path QIIME2_EUK_SSU_v1.8.fasta --output-path eukaryome_seq.qza --type 'FeatureData[Sequence]'
1.2. Taxonomic analysis
With the database imported, you can now perform taxonomic analysis. The taxonomic classification can be performed with different plugins; see the basic scripts below:
Feature Classifier Plugin:
qiime feature-classifier fit-classifier-naive-bayes --i-reference-reads eukaryome_seq.qza --i-reference-taxonomy eukaryome_tax.qza --o-classifier classifier.qza
Then, you can assign taxonomy to your sequences using the trained classifier:
qiime feature-classifier classify-sklearn --i-classifier classifier.qza --i-reads your_sequences.qza --o-classification taxonomy.qza
Blast+ plugin:
qiime feature-classifier classify-consensus-blast --i-query your_sequences.qza --i-reference-reads eukaryome_seq.qza --i-reference-taxonomy eukaryome_tax.qza --p-perc-identity 0.97 --o-classification taxonomy.qza
Please note that these are basic scripts and may need to be adjusted based on your specific needs and the parameters of your data.
2. mothur Integration with EUKARYOME
Here are the following basic scripts to use with different methods in mothur:
# Wang method (default)
mothur "#classify.seqs(fasta=your_sequences.fasta, reference=mothur_SSU_v1.8.fasta, taxonomy=mothur_SSU_v1.8.tax, method=wang, cutoff=80)"
# k-nearest neighbor consensus
mothur "#classify.seqs(fasta=your_sequences.fasta, reference=mothur_SSU_v1.8.fasta, taxonomy=mothur_SSU_v1.8.tax, method=knn, cutoff=80)"
# Zap
mothur "#classify.seqs(fasta=your_sequences.fasta, reference=mothur_SSU_v1.8.fasta, taxonomy=mothur_SSU_v1.8.tax, method=zap, cutoff=80)"
Please note that these are basic scripts and may need to be adjusted based on your specific needs and the parameters of your data.
3. SINTAX Integration with EUKARYOME
To perform taxonomic analysis using SINTAX and the EUKARYOME database, use the following commands:
#converting to udb format:
vsearch --makeudb_usearch SINTAX_SSU_v1.8.fasta --output SINTAX_SSU.udb
#assign taxonomy:
vsearch --sintax your_sequences.fasta -db SINTAX_SSU.udb -tabbedout sintax_results.txt -sintax_cutoff 0.8
Please note that these are basic scripts and may need to be adjusted based on your specific needs and the parameters of your data.
4. BLAST
First, you must format your fasta file into a BLAST database using the makeblastdb command. This command will create a nucleotide BLAST database from your fasta file:
makeblastdb -in General_SSU_v1.8.fasta -dbtype nucl -out EUK_SSU.DB
Then you can use blastn with your input sequences and the BLAST database:
blastn -query your_sequences.fasta -db EUK_SSU.DB -out results.out
Please refer to the BLAST+ manual or other relevant resources for more advanced usage.
5. DADA2
To perform taxonomic analysis using the DADA2 classifier and the EUKARYOME database, use the following commands:
library(dada2)
asv <- "input.fasta"
database <- "DADA2_EUK_ITS_v1.9.fasta"
taxa_output_file <- "taxonomy_result.csv"
taxonomy <- assignTaxonomy(asv,database, multithread =8)
write.csv(taxonomy, file = taxa_output_file)
Please refer to the DADA2 manual or other relevant resources for more advanced usage.