Your browser doesn't support javascript.
loading
Dataset of next-generation sequence reads of nanobody clones in phage display library derived from Indian desert camel (Camelus dromedarius L.).
Banerjee, Somesh; Singh, Ajit; Rawat, Jagveer; Bansal, Nitish; Maan, Sushila.
Affiliation
  • Banerjee S; Immunology Section, Department of Veterinary Microbiology, India.
  • Singh A; Immunology Section, Department of Veterinary Microbiology, India.
  • Rawat J; Immunology Section, Department of Veterinary Microbiology, India.
  • Bansal N; Department of Veterinary Public Helth & Epidemiology, India.
  • Maan S; Department of Animal Biotechnology, Lala Lajpat Rai University of Veterinary and Animal Sciences, Hisar, Haryana 125004, India.
Data Brief ; 34: 106663, 2021 Feb.
Article in En | MEDLINE | ID: mdl-33385028
ABSTRACT
Next-generation sequences (NGS) dataset of nanobody (Nb) clones in a phage display library (PDL) is of immense value as it serves in many different ways, such as i). estimating the library size, ii). improving selection and identification of Nbs, iii). informing about frequency of V gene families, diversity and length of CDRs, iv). high resolution analysis of natural and synthetic libraries, etc. [1], [2], [3]. We used a fraction of our previously constructed PDL of Nbs derived from an E. coli lipopolysaccharide-immunized Indian desert camel in order to obtain the dataset of NGS reads of Nbs. The cryo-preserved transformants library was revived to extract the Nb-encoding VHH (inserts)-pHEN4 (vector) DNA pool. The DNA sample was used for amplifying VHH pool by PCR [6]. The VHH amplicons band was gel-purified and subjected to NGS using Illumina MiSeqTM platform. 'Nextra XT micro V2 Index' kit was used for the Nb library DNA sample sequencing, with the adaptors 'i7' (N706 TAGGCATG) and 'i5' (S517 GCGTAAGA). The raw data comprised of a total read count of 182146 (matched= 179591; unmatched=2555), with average read length of 130.33 bases and a total of 23.74 Mb. Of 179591 matched reads, 142004 were paired reads and 37587 broken paired reads. The raw data of NGS reads was submitted to NCBI Sequence Reads Archive accessible at URL https//www.ncbi.nlm.nih.gov/Traces/study/?acc=PRJNA516512 (dataset ref. [7]), and after analysis deposited in Mendeley Datasets repository, which is accessible at URL [https//data.mendeley.com/datasets/4rsz3snvk5/3] (dataset ref. [8]). The sequence reads were analyzed by bioinformatics tools [9], [10], [11], [12]. The assembled consensus contigs revealed Nb orthologs of diverse Ag-specificities, including those isolated by conventional panning and Sanger-sequenced functional Nbs. Contig 1 CDR1-3 matched to those of anti-Trypanosoma evansi RoTat1.2 variant surface glycoprotein (VSG), while Contig 2 CDR1-3 matched to those of anti-LPS Nb clones isolated from the library. Contig 3 was however incomplete and lacked CDR3. Despite lacking the depth, the NGS data is a useful guide for selection of antigen-specific Nbs from the library, as demonstrated by anti-T. evansi VSG Nbs, and provides templates for Nb-based diagnostic reagents and therapeutic agents.
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Data Brief Year: 2021 Document type: Article Affiliation country: India

Full text: 1 Collection: 01-internacional Database: MEDLINE Language: En Journal: Data Brief Year: 2021 Document type: Article Affiliation country: India