Project - PRJNA498591

PacBio sequencing of 16S-ITS-23S rRNA operon amplicons


Amplicon sequencing of the 16S rRNA gene is the predominant method to quantify microbial compositions and to discover novel lineages. However, traditional short amplicons often do not contain enough information for confident deep phylogenetic placements. Here we present a cost-effective protocol that amplifies a large part of the rRNA operon and sequences the amplicons with PacBio technology. We tested our method on a mock community and developed a read-curation pipeline that reduces the overall read error rate to 0.18%. Applying our method on four environmental samples, we captured near full-length rRNA operon amplicons from a large diversity of prokaryotes. The method operated at moderately high-throughput (22286 - 37850 raw ccs reads). Phylogenetic trees constructed with these sequences showed an increase in statistical support compared to trees inferred with sequences similar to 250 bp amplicons of the 16S rRNA gene and identified several novel prokaryotic lineages. Our method allows users to obtain good quality, near full-length 16S and 23S rRNA gene sequences from environmental taxa and determines their phylogenetic context more confidently compared to short-read 16S rRNA gene sequencing methods.

External Links

ENA website

NCBI website


Confident phylogenetic identification of uncultured prokaryotes through long read amplicon sequencing of the 16S-ITS-23S rRNA operon
Joran Martijn, et al. 2019

Runs 5

Sample Accession Run Accession Scientific Name Instrument Platform Instrument Model Library Name
SAMN04328859 SRR8113898 marine sediment metagenome PacBio SMRT PacBio RS II PM3
SAMN08011025 SRR8113902 sediment metagenome PacBio SMRT PacBio RS II P19
SAMN10319704 SRR8113900 biofilm metagenome PacBio SMRT PacBio RS II SALA
SAMN10319748 SRR8113901 synthetic metagenome PacBio SMRT PacBio RS II MOCK
SAMN10320202 SRR8113899 marine sediment metagenome PacBio SMRT PacBio RS II TNS08