Pan-genomics: applications, challenges, and future prospects debmalya barh - The latest ebook is ava

Page 1


https://ebookmass.com/product/pan-genomics-applicationschallenges-and-future-prospects-debmalya-barh/

Instant digital products (PDF, ePub, MOBI) ready for you

Download now and discover formats that fit your needs...

Advanced Biofuel Technologies: Present Status, Challenges and Future Prospects D.K. Tuli

https://ebookmass.com/product/advanced-biofuel-technologies-presentstatus-challenges-and-future-prospects-d-k-tuli/

ebookmass.com

Environmental Sustainability of Biofuels: Prospects and Challenges Khalid Hakeem

https://ebookmass.com/product/environmental-sustainability-ofbiofuels-prospects-and-challenges-khalid-hakeem/

ebookmass.com

Graphene: Preparations, Properties, Applications, and Prospects Kazuyuki Takai

https://ebookmass.com/product/graphene-preparations-propertiesapplications-and-prospects-kazuyuki-takai/

ebookmass.com

Truly Madly Plaid Eliza Knight

https://ebookmass.com/product/truly-madly-plaid-eliza-knight-2/

ebookmass.com

A Cowboy State of Mind (Creedence Horse Rescue 1) 1st Edition Jennie Marts

https://ebookmass.com/product/a-cowboy-state-of-mind-creedence-horserescue-1-1st-edition-jennie-marts/

ebookmass.com

The Shadow Princess: A Stone Veil Novel (Chronicles of the Stone Veil Book 6) Sawyer Bennett

https://ebookmass.com/product/the-shadow-princess-a-stone-veil-novelchronicles-of-the-stone-veil-book-6-sawyer-bennett/

ebookmass.com

The Holocaust Bystander in Polish Culture, 1942-2015: The Story of Innocence Maryla Hopfinger

https://ebookmass.com/product/the-holocaust-bystander-in-polishculture-1942-2015-the-story-of-innocence-maryla-hopfinger/

ebookmass.com

Data Structures & Algorithms in Python 1st Edition John Canning

https://ebookmass.com/product/data-structures-algorithms-inpython-1st-edition-john-canning/

ebookmass.com

Burnout For Dummies 1st Edition Eva Selhub

https://ebookmass.com/product/burnout-for-dummies-1st-edition-evaselhub/

ebookmass.com

Nurse Leader Certification Review, Second Edition (Ebook PDF)

https://ebookmass.com/product/clinical-nurse-leader-certificationreview-second-edition-ebook-pdf/

ebookmass.com

AcademicPressisanimprintofElsevier

125LondonWall,LondonEC2Y5AS,UnitedKingdom

525BStreet,Suite1650,SanDiego,CA92101,UnitedStates

50HampshireStreet,5thFloor,Cambridge,MA02139,UnitedStates

TheBoulevard,LangfordLane,Kidlington,OxfordOX51GB,UnitedKingdom

©2020ElsevierInc.Allrightsreserved.

Nopartofthispublicationmaybereproducedortransmittedinanyformorbyanymeans,electronicor mechanical,includingphotocopying,recording,oranyinformationstorageandretrievalsystem,without permissioninwritingfromthepublisher.Detailsonhowtoseekpermission,furtherinformationaboutthePublisher’ s permissionspoliciesandourarrangementswithorganizationssuchastheCopyrightClearanceCenterandthe CopyrightLicensingAgency,canbefoundatourwebsite: www.elsevier.com/permissions.

ThisbookandtheindividualcontributionscontainedinitareprotectedundercopyrightbythePublisher (otherthanasmaybenotedherein).

Notices

Knowledgeandbestpracticeinthisfieldareconstantlychanging.Asnewresearchandexperiencebroadenour understanding,changesinresearchmethods,professionalpractices,ormedicaltreatmentmaybecomenecessary.

Practitionersandresearchersmustalwaysrelyontheirownexperienceandknowledgeinevaluatingandusing anyinformation,methods,compounds,orexperimentsdescribedherein.Inusingsuchinformationormethods theyshouldbemindfuloftheirownsafetyandthesafetyofothers,includingpartiesforwhomtheyhavea professionalresponsibility.

Tothefullestextentofthelaw,neitherthePublishernortheauthors,contributors,oreditors,assumeanyliabilityfor anyinjuryand/ordamagetopersonsorpropertyasamatterofproductsliability,negligenceorotherwise,orfromany useoroperationofanymethods,products,instructions,orideascontainedinthematerialherein.

LibraryofCongressCataloging-in-PublicationData

AcatalogrecordforthisbookisavailablefromtheLibraryofCongress

BritishLibraryCataloguing-in-PublicationData

AcataloguerecordforthisbookisavailablefromtheBritishLibrary

ISBN978-0-12-817076-2

ForinformationonallAcademicPresspublicationsvisit ourwebsiteat https://www.elsevier.com/books-and-journals

Publisher: StacyMasucci

AcquisitionsEditor: RafaelE.Teixeira

EditorialProjectManager: SaraPianavilla

ProductionProjectManager: MariaBernard

Designer: GregHarris

TypesetbySPiGlobal,India

Editorsbiography

SiomardeCastroSoares holdsanMScingenetics, PhDingeneticsandPhDinbioinformatics.Hewas aseniorbioinformaticsresearcherattheOfficialLaboratoryoftheMinistryofFisheries.Currently,heis workingasanassistantprofessoratFederalUniversity ofTri^anguloMineiro—UFTMandisanaffiliatememberoftheBrazilianAcademyofSciences.Hehas published79researchpublicationsand4bookchapters. Hisareasofexpertiseincludemoleculargenetics, genomicsequencing,andmicrobialcomparativegenomics,mainlyfocusedonpan-genomics,theroleof pathogenicityislandsandvirulencefactorsingenome plasticity,phylogenomics,molecularepidemiology, reversevaccinology,andsoftwaredevelopment.

SandipTiwari isapostdoctoralresearcheratthe LaboratoryofCellularandMolecularGenetics,Institute ofBiologicalSciencesInstitute,FederalUniversityof MinasGerais,Brazil.Heearnedabachelor’sdegreein microbiologyin2009fromDeenDayalUpadhyaya GorakhpurUniversity(India),amaster’sdegreein bioinformaticsin2011fromMadhyaPradeshStateUniversity(India),andadoctoratein2017inbioinformatics fromUFMG.Heworksintheareasofbioinformatics, genomics,transcriptomics,proteomics,anddrugtarget identificationagainstinfectiousdiseases.

DebmalyaBarh holdsanMScinappliedgenetics, MTechinbiotechnology,MPhilinbiotechnology, PhDinbiotechnology,PhDinbioinformatics,Postdoc inbioinformatics,andPGDMinpostgraduateinmanagement.HeisanhonoraryscientistattheInstituteof IntegrativeOmicsandAppliedBiotechnology(IIOAB), India.Dr.Barhisblendedwithbothacademicandindustrialresearchfordecadesandisanexpertinintegrative omics-basedbiomarkerdiscovery,moleculardiagnosis, andprecisionmedicineinvariouscomplexhumandiseasesandtraits.Heworkswithover400scientistsfrom morethan100organizationsacrossmorethan40countries.Dr.Barhhaspublishedover150researchpublications,morethan32bookchapters, andhaseditedmorethan20cutting-edge,omics-relatedreferencebookspublishedby Taylor&Francis,Elsevier,andSpringer.HefrequentlyreviewsarticlesforNaturepublications,Elsevier,AACRjournals,NAR,BMCjournals,PLOSONE,andFrontiers,to nameafew.Hehasbeenrecognizedby Who’sWhointheWorld and LimcaBookofRecords forhissignificantcontributionsinmanagingadvancescientificresearch.

VascoAzevedo isaseniorprofessorofgeneticsand deputyheadoftheDepartmentofGenetics,Ecology, andEvolutionatUniversidadeFederaldeMinasGerais, Brazil.HeisamemberoftheBrazilianAcademyofSciencesandisaknightoftheNationalOrderofScientific MeritoftheBrazilianMinistryofScience,Technology andInnovation.Heisalsoaresearcher1Aofthe NationalCouncilforScientificandTechnological Development(CNPq),whichisthehighestposition. ProfessorAzevedoisamoleculargeneticistwhograduatedfromveterinaryschool,FederalUniversityofBahia in1986.Heearnedhismaster’sin1989andPhDin1993 inmoleculargeneticsfromInstitutNationalAgronomiqueParis-Grignon(INAPG)and InstitutNationaldelaRechercheAgronomique(INRA),France.HeconductedapostdoctoralresearchatMicrobiologyDepartmentofMedicineSchoolfromUniversityof Pennsylvania,UnitedStates,in1994.In2017,heearnedanotherPhDinthefieldof bioinformatics.Histotalresearchpublicationsincludemorethan400researcharticles, 3books,andmorethan30bookchapters.ProfessorAzevedoisapioneerofgenetics oflacticacidbacteriaand Corynebacteriumpseudotuberculosis inBrazil.Hehasspecialized andcurrentlyresearchingonbacterialgenetics,genomes,transcriptomes,andproteomes fordevelopmentofnewvaccinesanddiagnosticagainstinfectiousdiseases.

Contributors

TalitaEmileRibeiroAdelino

Laborato ´ riodeGeneticaCelulareMolecular,ICB,UniversidadeFederaldeMinasGerais; Fundac ¸aoEzequeilDias(Funed),BeloHorizonte,Brazil

JamilAhmad

ResearchCenterforModeling&Simulation(RCMS),NationalUniversityofSciencesand Technology(NUST),Islamabad,Pakistan

ShahbazAhmed

Atta-ur-RahmanSchoolofAppliedBiosciences,NationalUniversityofSciencesand Technology,Islamabad,Pakistan

LuizCarlosJuniorAlcantara

PGPrograminBioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinas Gerais(UFMG),BeloHorizonte;Laborato ´ riodeFlavivı´rus,InstitutoOswaldoCruz, Fundac ¸ a ˜ oOswaldoCruz,RiodeJaneiro;Laborato ´ riodeGeneticaCelulareMolecular,ICB, UniversidadeFederaldeMinasGerais,BeloHorizonte,Brazil

AmjadAli

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

RabiaAmir

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

FabricioAraujo

InstituteofBiologicalSciences,FederalUniversityofPara ´ (UFPA),Belem,Brazil

MuneebaArveen

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

VascoAzevedo

PGPrograminBioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinas Gerais(UFMG),BeloHorizonte,Brazil

JahanzaibAzhar

Atta-ur-RahmanSchoolofAppliedBiosciences,NationalUniversityofSciencesand Technology,Islamabad,Pakistan

LucianaBalbo

StateUniversityofLondrina,Londrina,Brazil

LiBao

NationalClinicalResearchcenterforCancer,TianjinMedicalUniversityCancerInstituteand Hospital;KeyLaboratoryofCancerPreventionandTherapy,Tianjin,China

DebmalyaBarh

CentreforGenomicsandAppliedGeneTechnology,InstituteofIntegrativeOmicsandApplied Biotechnology(IIOAB),PurbaMedinipur,India

FernandaKhouriBarreto

Laborato ´ riodePatologiaExperimental,InstitutoGonc ¸aloMoniz,FiocruzBahia;Instituto MultidisciplinaremSau ´ de—IMS,UniversidadeFederaldaBahia(UFBA),Salvador,Brazil

AttyaBhatti

Atta-ur-RahmanSchoolofAppliedBiosciences,NationalUniversityofSciencesand Technology,Islamabad,Pakistan

AndreasBurkovski

Friedrich-Alexander-UniversitatErlangen-Nurnberg,Erlangen,Germany

RobertaTorresChideroli StateUniversityofLondrina,Londrina,Brazil

MauricioCorredor

GEBIOMICGroup,FCEN,UniversityofAntioquia,Medellin,Colombia

KennydaCostaPinheiro InstituteofBiologicalSciences,FederalUniversityofPara ´ (UFPA),Belem,Brazil

ArturLuizdaCostaSilva InstituteofBiologicalSciences,FederalUniversityofPara ´ (UFPA),Belem,Brazil

HamzaArshadDar

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

Letı´ciadeCastroOliveira

DepartmentofImmunology,MicrobiologyandParasitology,InstituteofBiologicalScienceand NaturalSciences,FederalUniversityofTri^anguloMineiro(UFTM),Uberaba,Brazil

SiomardeCastroSoares

DepartmentofImmunology,MicrobiologyandParasitology,InstituteofBiologicalScienceand NaturalSciences,FederalUniversityofTri^anguloMineiro(UFTM),Uberaba,Brazil

JaquelineGoesdeJesus

Laborato ´ riodeFlavivı´rus,IOC,Fundac ¸ a ˜ oOswaldoCruz,RiodeJaneiro;Laborato ´ riode PatologiaExperimental,InstitutoGonc ¸aloMoniz,FiocruzBahia,Salvador,Brazil

ThiagodeJesusSousa

PGPrograminBioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinas Gerais(UFMG),BeloHorizonte,Brazil

TuliodeOliveira

KwaZulu-NatalResearchInnovationandSequencingPlatform(KRISP),CollegeofHealth Sciences,UniversityofKwaZulu-Natal,Durban,SouthAfrica

StephaneFragadeOliveiraTosta

PGPrograminBioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinas Gerais(UFMG);Laborato ´ riodeGeneticaCelulareMolecular,ICB,UniversidadeFederalde MinasGerais,BeloHorizonte,Brazil

UlissesdePa ´ duaPereira

StateUniversityofLondrina,Londrina,Brazil

VagnerdeSouzaFonseca

Laborato ´ riodeFlavivı´rus,IOC,Fundac ¸ a ˜ oOswaldoCruz,RiodeJaneiro;Laborato ´ riode GeneticaCelulareMolecular,ICB,UniversidadeFederaldeMinasGerais,BeloHorizonte, Brazil;KwaZulu-NatalResearchInnovationandSequencingPlatform(KRISP),Collegeof HealthSciences,UniversityofKwaZulu-Natal,Durban,SouthAfrica

DipaliDhawan

BaylorGenetics,Houston,TX,UnitedStates

CesarToshioFacimoto

StateUniversityofLondrina,Londrina,Brazil

NunoRodriguesFaria

DepartmentofZoology,UniversityofOxford,Oxford,UnitedKingdom

NosheenFatima

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

HenriqueCesarPereiraFigueiredo

AQUACEN,NationalReferenceLaboratoryforAquaticAnimalDiseases,MinistryofFisheries andAquaculture,UniversidadeFederaldeMinasGerais,BeloHorizonte,Brazil

MartaGiovanetti

PGPrograminBioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinas Gerais(UFMG),BeloHorizonte;Laborato ´ riodeFlavivı´rus,IOC,Fundac ¸ a ˜ oOswaldoCruz, RiodeJaneiro;Laborato ´ riodeGeneticaCelulareMolecular,ICB,UniversidadeFederalde MinasGerais,BeloHorizonte,Brazil

Aristo ´ telesGo ´ es-Neto

MolecularandComputationalBiologyofFungiLaboratory,DepartmentofMicrobiology, InstituteofBiologicalSciences(ICB),FederalUniversityofMinasGerais(UFMG),Belo Horizonte,Brazil

AnneCybellePintoGomide

PGPrograminBioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinas Gerais(UFMG),BeloHorizonte,Brazil

LuisCarlosGuimara˜es

InstituteofBiologicalSciences,FederalUniversityofPara ´ (UFPA),Belem,Brazil

RaquelEnmaHurtado

PGPrograminBioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinas Gerais(UFMG),BeloHorizonte,Brazil

FelipeCamposMeloIani

Laborato ´ riodeGeneticaCelulareMolecular,ICB,UniversidadeFederaldeMinasGerais; Fundac ¸ a ˜ oEzequeilDias(Funed),BeloHorizonte,Brazil

IzabelaCoimbraIbraim

PGPrograminBioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinas Gerais(UFMG),BeloHorizonte,Brazil

MadangchanokImchen

DepartmentofGenomicScience,SchoolofBiologicalSciences,CentralUniversityofKerala, Kasaragod,India

ArunKumarJaiswal

DepartmentofImmunology,MicrobiologyandParasitology,InstituteofBiologicalScienceand NaturalSciences,FederalUniversityofTri^anguloMineiro(UFTM),Uberaba;PGProgramin Bioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinasGerais(UFMG), BeloHorizonte,Brazil

SyedBabarJamal

DepartmentofBiologicalSciences,NationalUniversityofMedicalSciences,Rawalpindi, Pakistan

PeterJohn

Atta-ur-RahmanSchoolofAppliedBiosciences,NationalUniversityofSciencesand Technology,Islamabad,Pakistan

RodrigoBentesKato

MolecularandComputationalBiologyofFungiLaboratory,DepartmentofMicrobiology, InstituteofBiologicalSciences(ICB),FederalUniversityofMinasGerais(UFMG),Belo Horizonte,Brazil

JaspreetKaur

UniversityInstituteofEngineeringandTechnology(UIET),DepartmentofBiotechnology, PanjabUniversity,Chandigarh,India

BineypreetKaur

UniversityInstituteofengineeringandTechnology(UIET),DepartmentofBiotechnology, PanjabUniversity,Chandigarh,India

RanjithKumavath

DepartmentofGenomicScience,SchoolofBiologicalSciences,CentralUniversityofKerala, Kasaragod,India

XiaofengLiu

NationalClinicalResearchcenterforCancer,TianjinMedicalUniversityCancerInstituteand Hospital;KeyLaboratoryofCancerPreventionandTherapy,Tianjin,China

NguyenThanhLuan

DepartmentofVeterinaryMedicine,InstituteofAppliedScience,HoChiMinhCityUniversity ofTechnology—HUTECH,HoChiMinhCity,Vietnam

WajahatMaqsood

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

WandersonMarquesdaSilva

InstituteofAgrobiotechnologyandMolecularBiology,INTA-CONICET,BuenosAires, Argentina

AnupriyaMinhas

UniversityInstituteofengineeringandTechnology(UIET),DepartmentofBiotechnology, PanjabUniversity,Chandigarh,India

FaizaMunir

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

AmaliaMun ˜ oz-Go ´ mez

GEBIOMICGroup,FCEN,UniversityofAntioquia,Medellin,Colombia

KanwalNaz

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

AnamNaz

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

AyeshaObaid

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

YanPantoja

InstituteofBiologicalSciences,FederalUniversityofPara ´ (UFPA),Belem,Brazil

RommelRamos

InstituteofBiologicalSciences,FederalUniversityofPara ´ (UFPA),Belem,Brazil

NoorUlSaba

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

AlvaroSalgado

Laborato ´ riodeGeneticaCelulareMolecular,ICB,UniversidadeFederaldeMinasGerais,Belo Horizonte,Brazil

VartulSangal

FacultyofHealthandLifeSciences,NorthumbriaUniversity,NewcastleuponTyne,United Kingdom

Qurat-ul-AinSani

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

NubiaSeyffert

BiologyInstitute,FederalUniversityofBahia,Salvador,Brazil

FaisalSherazShah

Atta-ur-RahmanSchoolofAppliedBiosciences,NationalUniversityofSciencesand Technology,Islamabad,Pakistan

FatimaShahid

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

MuhammadShehroz

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

AmnahSiddiqa

ResearchCenterforModeling&Simulation(RCMS),NationalUniversityofSciencesand Technology(NUST),Islamabad,Pakistan

GyanP.Srivastava

MolecularBiologyandGeneticEngineeringLaboratory,DepartmentofBotany,Universityof Allahabad,Allahabad,India

GuilhermeCamposTavares UniversidadeNiltonLins,Manaus,Brazil

HaiHaPhamThi

FacultyofBiotechnologyandEnvironmentalTechnology,NguyenTatThanhUniversity, HoChiMinhCity,Vietnam

SandeepTiwari

PGPrograminBioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinas Gerais(UFMG),BeloHorizonte,Brazil

BasantK.Tiwary

CentreforBioinformatics,PondicherryUniversity,Pondicherry,India

NimatUllah

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

RavaliKrishnaVennapu

DepartmentofGenomicScience,SchoolofBiologicalSciences,CentralUniversityofKerala, Kasaragod,India

JoilsonXavier

Laborato ´ riodeFlavivı´rus,IOC,Fundac ¸ a ˜ oOswaldoCruz,RiodeJaneiro;Laborato ´ riode

PatologiaExperimental,InstitutoGonc ¸aloMoniz,FiocruzBahia,Salvador,Brazil

NeelamYadav

MolecularBiologyandGeneticEngineeringLaboratory,DepartmentofBotany,Universityof Allahabad,Allahabad,India

BhupendraN.S.Yadav

MolecularBiologyandGeneticEngineeringLaboratory,DepartmentofBotany,Universityof Allahabad,Allahabad,India

RajivK.Yadav

MolecularBiologyandGeneticEngineeringLaboratory,DepartmentofBotany,Universityof Allahabad,Allahabad,India

DineshK.Yadav

MolecularBiologyandGeneticEngineeringLaboratory,DepartmentofBotany,Universityof Allahabad,Allahabad,India

TahreemZaheer

DepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB), NationalUniversityofSciencesandTechnology(NUST),Islamabad,Pakistan

PAN-GENOMICS: APPLICATIONS, CHALLENGES,AND FUTURE PROSPECTS

PAN-GENOMICS: APPLICATIONS, CHALLENGES,AND FUTUREPROSPECTS

DEBMALYABARH,PhD

Scientist,CentreforGenomicsandAppliedGeneTechnology,Instituteof IntegrativeOmicsandAppliedBiotechnology(IIOAB) Nonakuri,India

SIOMARSOARES,PhD

AssistantProfessoratDepartmentofImmunology,Microbiology andParasitology,InstituteofBiologicalSciencesandNatural Sciences,FederalUniversityofTrianguloMineiro(UFTM) Uberaba,Brazil

SANDEEPTIWARI,PhD

Post-DoctoralResearcher,LaboratoryofCellularandMolecularGenetics, FederalUniversityofMinasGerais(UFMG) BeloHorizonte,Brazil

VASCOAZEVEDO,PhD

SeniorProfessor,InstituteofBiologicalSciences,FederalUniversityofMinas Gerais(UFMG)

BeloHorizonte,Brazil

Preface

Sincethedevelopmentofthenext-generationsequencingtechnologies,manygenomes havebeendepositedinthedatabasesand,asaresult,thetermpan-genomewascoinedin 2005todescribeanewareaofgenomicsanalysesthatusedseveralstrainsofthesame speciestogaininsightsintothedevelopmentofbacterialgenomes.Thisareahasthen expanded,andnowotherapplicationshaveappearedtocomplementthepan-genomics, creatingthepan-omicsanalyses.Thisbookwasconceivedtobeacompendiumofpangenomicsandotherpan-omicsanalysesfromdifferentorganisms.

Thebook Pan-Genomics:Applications,Challenges,andFutureProspects beginswithan introductiononpan-omicsfocusedtoCrick’sCentralDogmaandabriefdescriptionof allthechaptersofthebook(Chapter1),inwhichsomebasicconceptsofpan-genomics areintroduced. Chapter2,ontheotherhand,discussestheuseofbioinformatics approachesappliedtopan-genomicsandtheirchallenges,withalistofsoftwarethat maybeusefulinthiscontext.In Chapter3,Dr.Tiwaryandcollaboratorsdiscussthe useofpan-genomicsinevolutionarystudiesbasedongenecontentandsinglenucleotide polymorphism.

Next,thechaptersexplorethepan-genomicsofmodelbacterialorganismsandits applicationsuchasinthediscoveryofvaccineanddrugtargetsagainstbacterialpathogens usingreversevaccinologyanddrugtargetanalyses(Chapter16). Chapter4 describesthe pan-genomicsanalysesof Corynebacteriumdiphtheriae and Corynebacteriumulcerans,the causativeagentsofdiphtheriaanddiphtheria-likediseases. Chapter5 describestheuse ofpan-genomicsinveterinarypathogens,focusingonthepan-genomeanalysisof Corynebacteriumpseudotuberculosis,thecausativeagentofCaseouslymphadenitisinsmallruminants.In Chapter6,Dr.Amirexploresthepan-genomesofplantpathogens,focusingon Pectobacteriumparmentieri, Pantoeaananatis, Erwiniaamylovora, Burkholderia, Xylellafastidiosa, Pucciniagraminis, and Zymoseptoriatritici.In Chapter7,Dr.Pereiraexploresthe pan-genomeoffoodpathogenssuchas Escherichiacoli, Salmonellaenterica, Clostridiumbotulinum, Clostridiumperfringens, Listeriamonocytogenes, and Staphylococcusaureus Chapter8 focusesonthepan-genomeofaquaticanimalssuchas Edwardsiella and Aeromonas.In Chapter9,Dr.Aliexploresthepan-genomesofmodelbacteriasuchas Streptococcusagalactiae, Neisseriameningitidis, Staphylococcusaureus, E.coli, Streptococcuspyogenes, Haemophilus influenzae, and Streptococcuspneumoniae.Finally,in Chapter10,thepan-genomeof multidrug-resistanthumanpathogenicbacteriaandtheirresistomearediscussed,focusingonbacteriasuchas Acinetobacterbaumannii and Pseudomonasaeruginosa

Otherchaptersfocusonvirus,plants,algae,fungi,andhumansinpan-canceranalyses. Chapter11 focusesonthepan-genomicsofvirusanditsapplicationstoprovideinsights

intothetransmission,biology,andepidemiologyofhealth-care-associatedviruspathogens,andalsoprovideadescriptionofsoftwareusedforthistask.In Chapter12, Go ´ es-Netoperformsanintensiveliteraturereviewandmetaanalysisofacustomized databasetoprovideinsightsintofunguspan-genomics,withdataonthemoststudied fungiofthe12moreexploredgenera.In Chapter13,Dr.Kaurdescribesthestateof theartinthegenomicsofalgaeorganisms,frommicrotomacroalgae. Chapter14 describesthepan-genomeofplantsanditsapplications,focusingon Brassicarapa, Brassica oleracea, Glycinesoja, Oryzasativa, and Brachypodiumdistachyon Chapter15 describesthe pan-cancerproject,whichmaybehelpfulincancerpreventionandinthedesignofnew cancertherapeutics.

Pan-omicsanalysesarefurtherdescribedinchaptersdedicatedtopan-proteomics, pan-metagenomics,pan-metabolomics,pan-interactomics,andpan-transcriptomics. Pan-metagenomicsisexploredin Chapter17 tobetterunderstandthemicrobiotaofa givenorganismorecosystemindifferentconditionsandalsotoexplorethecommonly sharedmicroorganismsintheseconditions.Theauthorsalsodiscusstheimportanceof pan-metagenomicsinpharmacokinetics.Pan-transcriptomics(Chapter18)andpanproteomics(Chapter19)areintendedtoanalyzethedatasetofdifferentiallyexpressed andcommonlyexpressedgenesindifferentconditionsinordertogiveinsightsintoadaptationtotheseconditions. Chapters20and21 explorepan-metabolomicsandpaninteractomics,respectively,whicharerecentareasofresearchandmayhelpinelucidating differentiallyregulatedmetabolicpathwaysandprotein-proteininteractionsindifferent conditions.

Atotalof65expertsfrom14countrieshavecontributedtothisbooktocoverwide areasofpan-genomics.Webelievethisbookwillprovidethereaderswiththemainstrategiesandtheirapplicationsutilizedsofarinpan-genomics.

Editors

DebmalyaBarh SiomarSoares

SandeepTiwari

VascoAzevedo

Pan-omicsfocusedtoCrick’scentral dogma

ArunKumarJaiswal*ab,SandeepTiwari*a,GuilhermeCamposTavaresh, WandersonMarquesdaSilvad,LetíciadeCastroOliveirab,IzabelaCoimbra Ibraima,LuisCarlosGuimarãese,AnneCybellePintoGomidea,SyedBabarJamalc, YanPantojae,BasantK.Tiwaryi,AndreasBurkovskij,FaizaMunirk,HaiHaPhamThil, NimatUllahk,AmjadAlik,MartaGiovanettia,m,LuizCarlosJuniorAlcantaraa,m, JaspreetKaurn,DipaliDhawano,MadangchanokImchenp, RavaliKrishnaVennapup,RanjithKumavathp,MauricioCorredorq,HenriqueCesar PereiraFigueiredog,DebmalyaBarhf,VascoAzevedoa,SiomardeCastroSoaresb aPGPrograminBioinformatics,InstituteofBiologicalSciences,FederalUniversityofMinasGerais(UFMG),BeloHorizonte,Brazil bDepartmentofImmunology,MicrobiologyandParasitology,InstituteofBiologicalScienceandNaturalSciences, FederalUniversityofTri^anguloMineiro(UFTM),Uberaba,Brazil

cDepartmentofBiologicalSciences,NationalUniversityofMedicalSciences,Rawalpindi,Pakistan dInstituteofAgrobiotechnologyandMolecularBiology,INTA-CONICET,BuenosAires,Argentina eInstituteofBiologicalSciences,FederalUniversityofPara ´ (UFPA),Belem,Brazil fCentreforGenomicsandAppliedGeneTechnology,InstituteofIntegrativeOmicsandAppliedBiotechnology(IIOAB), PurbaMedinipur,India

gAQUACEN,NationalReferenceLaboratoryforAquaticAnimalDiseases,MinistryofFisheriesandAquaculture, UniversidadeFederaldeMinasGerais,BeloHorizonte,Brazil hUniversidadeNiltonLins,Manaus,Brazil iCentreforBioinformatics,PondicherryUniversity,Pondicherry,India jFriedrich-Alexander-UniversitatErlangen-Nurnberg,Erlangen,Germany kDepartmentofPlantBiotechnology,Atta-ur-RahmanSchoolofAppliedBiosciences(ASAB),NationalUniversityofSciences andTechnology(NUST),Islamabad,Pakistan lFacultyofBiotechnologyandEnvironmentalTechnology,NguyenTatThanhUniversity,HoChiMinhCity,Vietnam mLaborato ´ riodeFlavivı´rus,IOC,Fundac ¸aoOswaldoCruz,RiodeJaneiro,Brazil, nUniversityInstituteofEngineeringandTechnology(UIET),DepartmentofBiotechnology,PanjabUniversity, Chandigarh,India oBaylorGenetics,Houston,TX,UnitedStates pDepartmentofGenomicScience,SchoolofBiologicalSciences,CentralUniversityofKerala,Kasaragod,India qGEBIOMICGroup,FCEN,UniversityofAntioquia,Medellin,Colombia

1Introduction

SincethedevelopmentofthefirstDNAsequencingtechnologies,manyorganismshad theircompleteDNArepertoiresequencedbySangerandnext-generationsequencing (NGS)technologies,creatingtheareaofgenomics,whichwasoriginatedbythefusion ofthewordsgeneandchromosome [1].Inthisscenario,agenomeisthecompletedataset ofgenesofagivenorganism.Nowadays,therearemorethan200,000genomeprojects registeredattheGenomeOnlineDatabase(GOLD),whereasmorethan120,000are

* Theseauthorscontributedequallytothiswork.

Pan-genomics:Applications,Challenges,andFutureProspects

2020ElsevierInc. https://doi.org/10.1016/B978-0-12-817076-2.00001-9

2 Pan-genomics:Applications,challenges,andfutureprospects

genomesisolatedfrombacteria(https://gold.jgi.doe.gov/statistics).Bacteriaarewidely distributedallovertheworldandhaveimplicationsinhealth,agriculture,industry,and others.Besides,theirgenomesaresmall,highlycompact,anddonotpresentmanyrepetitions,makingthemgoodtargetsforgenomesequencing,oncetheirgenomesareeasier tosequencethantheonesfromotherorganisms.Also,fromthegenomesequenceof bacteria,itispossibletofindvirulencefactors,antibioticresistancegenes,newtherapeutictargetsforvaccineanddrugdevelopment,andindustriallyimportantgenes [2,3].

AnotherimportantpointofthedevelopmentofNGStechnologieswasthegenome sequencingprocessthathasbecomecheaperandfaster,makingitpossibleforsmalllaboratoriestousethetechnologyindailyroutine.NGSmadepossiblethecomparisonofseveralgenomesinamultiprongedstrategy,wherephylogenomics,genomeplasticity,and wholegenomesyntenyanalysesareeasiertoperformnowadays(Fig.1).Also,RNA sequencing(RNA-seq)bytheseplatformsandthedevelopmentofnewtechnologies forsequencingthecompletedatasetofproteinsofanorganismcreatedtheareasoftranscriptomicsandproteomics,respectively [4,5].Altogether,genomicsisresponsibleforthe identificationofthecompletedatasetofgenesofagivenorganism,whereastranscriptomics andproteomicsareimportantfortheidentificationofgenesthataredifferentiallyexpressed

Fig.1 Pan-omicsanditsapplications.

betweenstrainsorspecies.Finally,theeffortstocompareseveralgenomesatoncecreated theareaofpan-genomics,whichwillbefurtherdiscussedinthisbook.

1.1Briefoverviewofpan-genomics

Thetermpan-genomicswascreatedbyTettelinandcollaborators,in2005 [6],to describethecompletedatasetofgenesofagivenspeciesthroughthesequencingofseveralstrainsofthisspecies.Thepan-genomeiscomposedofthecoregenome,shared genome,andsingletonssubsets,whereasthecoregenomeiscomposedofallthecommonlysharedgenesbyallstrainsofthespecies;thesharedgenomecontainsgenesthatare presentintwoormore,butnotallstrainsfromaspecies;andthesingletonsarestrainspecificgenes(Fig.2).Fromthesesubsets,onecanextrapolatethedatatofindvaccines anddrugtargetsfromthecoregenome,whereasthesharedgenesandsingletonsare responsiblefordifferencesbetweenthestrainsthatarenormallyresponsiblefortheemergenceofnewpathogensandtheadaptationtonewtraits [6–10]

Normally,thecoregenomeiscomposedofhousekeepinggenesandothergenes importantformetabolismandotherimportantfunctionsoftheorganism,whereasthe sharedgenesandsingletonsaretheresultofgenomeplasticity.Genomeplasticityis thedynamicpropertyofDNAwhichinvolvesthegain,loss,andrearrangementofgenes throughplasmids,phages,andgenomicislands(GEIs).GEIsarehugeblocksofgenes acquiredthroughhorizontalgenetransfer(HGT)thatnormallyshareafunctionincommon.Theyareclassifiedaccordingtothefunctionsofthegenesinto:pathogenicity islands,harboringvirulencefactors;metabolicislands,composedofmetabolism-related genes;resistanceislands,withantibioticresistantgenes;andsymbioticislands,which shareincommonthepresenceofsymbiotic-relatedgenes [11,12]

Normally,thesubsetsofthepan-genomeareidentifiedbytheuseoforthologyanalyses,whichfirstidentifyallorthologousgenesfromthecompletedatasetusingall-vs-all blastsorotheralignmentsearchtools.Next,thedatasetsareclassifiedaccordingtotheir homologytogenesfromotherstrainsinthesubsets.Aftertheclassification,thedatais plottedinachartandmathematicalformulasareusedtofitthespecificcurves.Twosuch

Fig.2 Schematicrepresentationofthecoregenome,sharedgenome,andsingletonsubsetsofpangenomeanalysis.

formulasareHeaps’lawforthepan-genomedevelopmentandleast-squaresfitofthe exponentialregressiondecayforthecoregenomeandsingletonsubsets,whichare describedrespectivelyas: n ¼ k N α,where n isthenumberofgenes, N isthenumber ofgenomes,and k and α areconstantsdefinedbytheformula;and n ¼ k e x/τ + tgθ , where n isthenumberofgenes, x isthenumberofgenomes, e isEuler’snumber, and k, τ,and tgθ areconstantsdefinedbytheformula [6,9].

1.2Openandclosedpan-genomes

AccordingtoHeap’slaw,the α valueisrepresentativeofthecurrentdynamicsofthepangenome,wherean α higherthan1isrepresentativeofaclosedpan-genomeandan α lowerthan1representsanopenpan-genome.Aclosedpan-genomehasallpossiblegenes representedandonlyfewgeneswillbeaddedtothepan-genomeifmoregenomesareto besequenced,whereasanopenpan-genomeisstillnotfullyrepresentedandthesequencingofnewgenomeswilladdmanygenestotheanalyses [6,9].Thisdefinitioniscontroversial,however,oncetheincorporationofGEIsmaychangethecompositionofthe pan-genomedrastically,evenforclosedpan-genomes,takingittobeopenagain.Most important,environmentalbacteriaandextracellularpathogensnormallyhaveopenpangenomes,oncetheystillneedtoadapttonewtraits,whereasobligateintracellularpathogenstendtohaveclosedpan-genomesoncetheyarenotinconstantcontactwithother bacteria.Also,intracellularpathogenshavelostmanygenesduringevolution,completely adaptingtothehostorganismand,thus,presentverycompactgenomeswithahighpercentageofessentialgenes [13]

Accordingtoleast-squaresfitoftheexponentialregressiondecay,the tgθ isrepresentative ofthenumberofgenespresentinthecoregenomeafterstabilizationofthecoregenome curveand,also,ofthenumberofgenesthatwillbeaddedtotheanalysesafteranewgenome issequencedfromthesingletondevelopmentcurve.Basedonthat,researchersmaychoose thespeciesthatneedmorestrainstobesequencedandwhichdonot.Finally,thehighestthe tgθ onthesingletondevelopment,thelowerthe α,onceahighnumberofgeneswillbeadded totheanalysestakingthepan-genometobemoreopenandthe α tobelower(Fig.3).The oppositeisalsotrue,thelowerthe tgθ ,thehigheristhe α value [6,7,10].

1.3Computationalmethodsusedinpan-genomics

Computationalmethodstofindmoreefficientdatastructures,algorithms,andstatistical methodstoperformbioinformaticsanalysesofpan-genomeshavebeenstudiedbecauseit isknownthatinapan-genomeanalysisthegreaterthenumberofgenomestakentothe analysisthegreaterwillbethecomputationalcosts,thatis,thediscoveryofapan-genome contentisanNP-hardproblembecausecomparisonsbetweenallsetsofgenesarenecessarytosolvethetask.Furthermore,inanefforttocomputestandardizedpan-genome analysisandminimizecomputationalchallenges,severalonlinetoolsandsoftwaresuites

(q )=69±9

tg(q )=4±1.5

Fig.3 Theconceptofopenandclosepan-genome.

havebeendeveloped.Examplesofsuchapplicationsare:PGAP [14],oneofthemost completeprofileavailableforperformingfiveanalysismodules,buttheruntimeofthe analysisgrowapproximatelyquadraticallywiththesizeofinputdataandarecomputationallyinfeasiblewithlargedatasets.ThesoftwareRoary [15] andBPGA [16] was createdtoaddressthecomputationalissuesrelatedtoperformanceandexecutiontime. Roaryperformsarapidclusteringofhighlysimilarsequences,whichcanreducethe runtimeofBLAST.BPGAisanultrafastcomputationalpipelinewithsevenfunctional modulesforcomprehensivepan-genomestudiesanddownstreamanalyses.Pan-genome analysiscanbeappliedinmanydifferentapplicationdomains,suchasmicrobes,metagenomics,viruses,plants,cancer,andothers [17].Nowadays,theprocessesofsimilarity searchandpan-genomevisualizationaretwoofthewidevarietyofparticularcomputationalchallengesthatneedtobeconsidered.Forthis,noveldifferentcomputational methodsandparadigmsareneededovertheyears,makingthecomputationalpangenomicsasubareaofresearchinrapidextension.Furthermore,newtechnologiesthat areemerginginrapiddevelopmentallowtoinferthepan-genomewiththreedimensionalconformation,whichmeansthatpossiblyinthefuturethree-dimensional pan-genomeswillnotonlyrepresentallsequencevariationofthespeciesorgenus, butwillalsoencodetheirspatialorganization,aswellastheirmutualrelationshipsin thisregard.

1.4Applicationsofpan-genomicsinevolutionarystudies

Themanifestationofrichgeneticdiversityintheformofapan-genomeinaspeciesisan evolutionarypuzzle.Thesethreedistinctpartsofapan-genome(core,shared,andsingletons)ofaparticularspeciesmayundergodifferentevolutionarytrajectoriesunderthe differentialinfluenceofevolutionaryforces.Anidealpan-genomeisexpectedtobevery complete,comprehensive,efficient,andstable [18].Thepan-genomeofaspecieshas someevolutionarysignaturesintheformofgenecontentandsinglenucleotide a <1 ® Open pan-genome a >1 ® Closed pan-genome Openpan-genome

polymorphism(SNP).Theseevolutionarysignaturesareusefulininferringthephylogeneticrelationshipamongdifferentstrainsofaspeciesbasedonthepan-genome.

Anevolutionarypan-genomicstudyofmicrobesprovidesaholisticpictureofallthe genomicvariationsofaspecies.Thesegenomicvariationsendowthebacteriawiththeir uniquepathogenicpropertiesandsubsequentdevelopmentofresistancetovariousantibiotics.Thus,acompletemechanisticdetailoftheprocessesinvolvedinthepathogenesis andfrequentantibioticresistanceinabacteriumwillfurtherpavethewayforbetter detectionmethodsandeffectivecontrolstrategiesforthepathogen.Inaddition,evolutionarypan-genomicsofausefulbacteriumwillhelpusinexploitingmaximallythefull potentialofthemicrobeinenhancingindustrialproductivity.Infact,itwillbeaboomfor theindustriesactivelyinvolvedintheproductionofpharmaceuticalsanddairyproducts usingmicrobialcultures.EukaryotesincludingcropplantsandfarmanimalshaveabundantgenomicvariationsintheformofSNP,copynumbervariants(CNVs),andpresence/absencevariants(PAVs).ThediscoveryofSNPsassociatedwithproductivityor diseaseresistanceinacroporafarmanimalwillbemuchmoreefficientwiththeavailabilityofacompletepan-genomeofthespecies [19].

Inarecentpast,aworkpublishedbyBenevidesetal. [20] utilized16SrRNAgene phylogeny,whole-genomemultilocussequencetyping(wgMLST),phylogenomics, genesynteny,averagenucleotideidentity(ANI),andpan-genometoexplainthephylogeneticrelationshipsinabetterwayamongstrainsof Faecalibacterium.Forthis,theyused 12newlysequenced,assembled,andcuratedgenomesof Faecalibacteriumprausnitzii, whichwereisolatedfromthefecesofhealthyvolunteersfromFranceandAustralia, andcombinedthesewithfivestrainsalreadypublished,whichweredownloadedfrom publicdatabases.Thephylogeneticanalysisofthe16SrRNAalongwiththewgMLST profileandthephylogenetictreebasedonthecomparisonofthesimilarityofgenome supportsthegroupingof Faecalibacterium strainsindifferentgenospecies [20].

InanotherworkpublishedbyChenetal. [21],thecomparisonofwholegenomeand coregenomemultilocussequencetyping(MLST)andSNPanalyseswerecarriedoutto showthemaximumbiasedpowerachievedbyusingmultipleanalyses.Itwasrequiredto differentiateisolatesassociatedwithoutbreakfromapulsed-fieldgelelectrophoresis (PFGE)-indistinguishableisolatecollectedin2012fromanonimplicatedfoodsource. Wholegenomesequencing(WGS)hasbeenprovenasapowerfulsubtypingtoolforbacterialike L.monocytogenes,afoodbornepathogen [21].Acompanyproducedanenvironmentalisolatethatwashighlysimilartoalloutbreakisolates.Thedifferenceobserved betweenunrelatedisolatesandoutbreakisolateswasonly7–14SNPs;consequently, theminimumspanningtreefromtheanalysesofwholegenome,phylogeneticalgorithm, andusualvariantcallingapproachforcoregenome-basedanalysescouldnotofferthe differencebetweenunrelatedisolates.ThisalsosuggestedthattheSNP/allelecounts shouldalwaysbepooledwithWGSclusteringanalysisproducedbyphylogenetically meaningfulalgorithmsonanadequatenumberofisolates,andtheSNP/allele

onsetalonedoesnotprovideenoughevidencetodemarcateanoutbreak [21].Hence,it wasproposedthatthecomparisonofpan-genomesubcategoriesandtheirrelated α value maybeutilizedasanalternateapproach,alongwithANI,intheinsilicocatalogingof newspecies [20,22].Wehopethattheever-expandingpan-genomeacrossdifferentspeciesandgenerawillgiveimpetustoabetterdatastructureofthepan-genomeandnovel computationalmethodsforarobustevolutionarypan-genomicanalysisinnearfuture.

2ApplicationsofPan-genomicsinBacteria

2.1Applicationsofpan-genomicsinmodelbacteria

Advancementinsequencingtechnologiesanddevelopmentinsophisticatedbioinformaticstoolscreatedanoverwhelmingnumberofmicrobialgenomicdataandallowed thescientificcommunitytoestimatethepan-genomeofaspecies.Identificationofnovel dispensablegeneshasapplicationsincharacterizingnovelmetabolicpathways,virulence determinants,andmolecularfingerprintingtargetsforepidemiologicalstudiesandcore genescanbeusedtopredicttheevolutionaryhistoryoftheorganism [9].Therefore,pangenomeanalysesarenowconsideredtheindispensableandgoldstandardforbacterial genomecomparisons,evolution,anddiversity.Itisalsousefultodevelopavaccine againstthepathogensofepidemicdiseasesbyfilteringdifferentfunctionalgenesinthe coregenomeusingreversevaccinologyapproaches [23]

Thereareanumberoffreelyaccessibletools,pipelines,andweb-serversavailableto estimatethemicrobialpan-genomeincludingRoary,BPGA,PGAP,PGAPx,Panseq, PanOCT,etc. [16].Anumberofmodelbacterialspeciespan-genomeisdetermined byresearchersandavastmajorityofthosehumanpathogensexhibitanopenpangenome,astheycolonizemultipleenvironmentsthatfacilitatethemtoexchangegenetic materials.Theseorganismsinclude Escherichiacoli, Meningococci, Streptococci, Salmonellae, Helicobacterpylori,etc. [24].Therefore,indealingwithsuchspeciesareasonablenumber ofgenomesisusuallyrequiredtodefinethecompletegenerepertoireofthesespecies.On theotherhand,specieslivinginisolated(close)habitatshavinglesspossibilitytoexchange geneticmaterialtendtohaveclosedpan-genome,forexample, Mycobacteriumtuberculosis, B.anthracis,and Chlamydiatrachomatis [25].Hence,pan-genomeanalysesserveasaframeworktodetermineandunderstandthegenomicdiversityinbacterialspecies.In Chapter17,wehavediscussedthebacterialpan-genomeanalysisperformedtilldatewith specificexamplesfrommodelorganismsalongwithstudyingapproaches,technical implementations,andtheiroutcome.

2.2Applicationsofpan-genomicsin Corynebacteriumdiphtheriae and Corynebacteriumulcerans

Thedevelopmentofdiphtheriatoxoidvaccinesinthe1920s,thestartofmassimmunizationinthe1940s,andtheglobalintroductionoftheExpandedProgramon

Immunization(EPI)bytheWorldHealthOrganization(WHO)in1974ledtoadramaticdecreaseofdiphtheriacases,bothinindustrializedanddevelopingcountries [26].However,despitethistremendoussuccessstory,diphtheriahasnotbeeneradicated yet.Thishasbeenillustrateddramaticallybyadiphtheriapandemicconnectedtothe breakdownoftheformerUnionofSocialistSovietRepublicswithmorethan 157,000casesandmorethan5000deathsreportedbetween1990and1998.Evenafter thepandemichasfinallystopped,localbreakoutshavebeenobservedconstantlyduring thelastyearsandthereportedglobalcasesincreasedfromabout7000in2016toalmost 9000in2017withafocusoncountrieswithlimitedorlackingpublichealthsystems,for exampleIndia,Indonesia,Nepal,Pakistan,Venezuela,andYemen.Consequently, Corynebacteriumdiphtheriae,theetiologicalagentofrespiratoryandcutaneousdiphtheria,isstill presentonthelistofthemostimportantglobalpathogens [27].Furthermore,thefrequencyofhumandiphtheria-likeinfectionsassociatedwith Corynebacteriumulcerans appearstobeincreasing [28].Thisspecies,whichwasrecognizedbeforeasacommensal ofalargenumberofanimalspecies,iscloselyrelatedto C.diphtheriae andrecognizedasan emergingpathogentoday [28,29].

Theneedoffastandunequivocalidentificationofespeciallypathogenic C.diphtheriae ledtotheearlydevelopmentofanumberofdifferentmethodssuchasbiovardiscriminationbasedondifferentbiochemicalreactions,Elek’stesttoimmunologicallydistinguishbetweentoxigenicandnontoxigenicstrains,restrictionfragmentlength polymorphism(RFLP),single-strandconformationpolymorphism(SSCP),phagetyping,spoligotyping,ribotyping,MLSTandothers.Thisplethoraofmethodswassignificantlyimprovedwhennext-generationsequencingwasintroduced.Thefirstgenome sequenceof C.diphtheriae waspublishedin2003andshowedthepresenceofthe tox gene onabacteriophageinadditiontoanumberofotherhorizontallyacquiredvirulenceassociatedgenes [30].Subsequentpan-genomestudiesallowedunravelingtheextent ofgenomicdiversitywithin C.diphtheriae andtheroleofHGTasasourceofvariation betweenstrains.Furthermore,pan-genomicsof C.ulcerans helpedtoestimatethevirulencepotentialofdifferentstrainsandtoverifyzoonotictransmissionfromanimalsto patients.Today,pan-genomicsof C.diphtheriae and C.ulcerans allowelucidatingglobal transmissiontraitsandlocaladaptationsofpathogeniccorynebacteriaand,hopefully,a betterunderstandingofpopulationdynamicsandstrainevolutionwillhelpcombatdiphtheriaandother Corynebacterium-associateddiseasesinfuture.

2.3Applicationsofpan-genomicsinmultidrug-resistanthuman pathogenicbacteriaandpan-resistome

Thepan-genomewillprobablybethelargestmolecularevolutionaryhistoryofthe organismeverwritten.Thiswillintegrateallthepan-phenotypesexistingonEarth,such asthepan-proteome,thepan-transcriptome,andespecially,aportionofpan-genome thathasmadetheorganismssuccessfulonEarth:thepan-resistome.Thepan-genome

representsthesetofallcurrentgenesinthegenomesofagroupoforganisms.Thebasic genomecommontoallbacteriacontainsabout250genefamiliesintheextended core,thespecificnicheadaptivegenomeofabout8000genefamiliesinthe charactergenepool,andthepan-genomicdiversity(accessorygenes)ofmorethan 139,000raregenefamiliesscatteredthroughoutthebacterialgenomes [31].Thepangenomeanalysis,wherebythesizeofthegenerepertoireaccessibletoanygivenspecies ischaracterizedalongwithanestimateofthenumberofwholegenomesequences requiredtoproperanalysis,andcurrentlyitisincreasing10yearsafterTettelinetal. [6] publication.Differentcurrentmodelsforthepan-genomeanalysis,accuracy,and applicabilitydependonthecaseathand [32].TheNCBI,EMBL,KEEG,PATRIC, MBGD,ENSEMBL,andJGI-IMG/Mdatabasesprovidecompletedownloadablegenomicsinformation,whichcanbeanalyzedforintraspeciesdiversity,anddeterminethe pan-genomeusingsoftwaretools,currentlydevelopedtoperformviaapersonalserver [32],orevenonlineresources.Thepan-genomicsisnowacuttingedgeofcomputational genomicsfield.Pan-genomicsisasubareaofcomputationalbiology [17].Therefore,the notionofcomputationalpan-genomicsintentionallypassesthroughmanyother bioinformatics-relateddisciplines.

Theresistome,atermcoinedbyWright [33],comprisesallthegenesandtheirproductsthatcontributetoresistwhateverenvironment,substance,orsomeextremegrow factor.Updateddatawillclosetothemetadataavailableforestablishingwhatpartofresistometraitsbelongbothtocore-genomeasaccessorygenomeinsideallbacterialspeciesas wellaswillofferabroaderperspectiveofbacterialantibioticresistance.TheWHOsummarizesantimicrobialresistance(AMR)astheresistanceofamicroorganismtoanantimicrobialdrugthatwasoriginallyeffectiveforthetreatmentofinfectionscausedby themselves.Anadequateapproachtosolvingmajorquestionsabouttheresistomeinside ofthebacterialgenome [34] istoperformapan-genomicsanalysis.Theupdatedpangenomedatawillbeclosetothemetadataavailableforestablishingthepartofresistome traitsthatbelongbothtocore-genomeasaccessorygenomeinbacterialspecies;aswellas abroaderperspectiveofantibioticresistanceinbacteria.Theemergentantibioticresistantpathogenicbacteriaareacurrentmenacingconcern. Pseudomonasaeruginosa, Acinetobacterbaumannii,andcoliformbacteriaarethenewemergentantibiotic-resistantbacteriaaccordingtotheWHO.Pan-genomicshastackledsomeimportantconcerns,which wouldbeimpossibletosolveusingclassicalmolecularbiologyordescriptivegenomics:it isveryimportanttodefinethecoreandaccessorygenomeforestablishingtheplasticityof resistome.Thousandsofunknownbacteriaandmicroorganismsareexposedtomanufacturedantibiotics,leadingustoassumethattherearenomeanstopreventthiscatastrophe. Inopposition,pan-genomicsisapowerfulapproachtopreventsuchdisaster.Wemust movetowardsequencingofknownandunknownspecies,classifythem,andestablishing itsantibiotic-resistantstatus,theirpan-genome,andcomeoutwithnewalternativesfor reducingantibioticconsumptionnowadays.

2.4Applicationsofpan-genomicsinveterinarypathogens

FollowingthedevelopmentofNGS,thenumberofsequencedgenomesfiledexponentially [35].Thus,projectsaimedatstudyinggroupsoforganismsbecameviable,andthus, severalstudiesappearedthatarecalledOmicsstudies.Thestudiesinvolvingpan-genomes areexposingimportantinformationonthedifferencesandsimilaritybetweenorganisms ofthesameorbetweenspecies.Forconceptpurposes,wehavethePan-genomeasaset ofgenesinagivengroupofindividuals [10].Thisinformationisbeingexploredand appliedbyseveralscientificfronts,forexample,inbacteriathatinfectanimalsand humans.Themainapplicationsofthesestudiesareinthedevelopmentofprophylactic anddiagnosticmethodsinlesstimeandwithlesscost,moreprecisetaxonomicstudies, studiesongeneticvariations,andpathogenesis [17].Inthischapter,wedescribemore recentresearchinvolvingpan-genomicsofthepathogenicbacteriathatcauseveterinary diseases,includingsomeresponsibleforzoonoses,theyare: Corynebacteriumpseudotuberculosis; Corynebacteriumulcerans; Streptococcussuis; Brachyspirahyodysenteriae; Moraxellabovoculi; Pasteurellamultocida; Mannheimiahaemolytica; Clostridiumbotulinum; Campylobacter; Streptococcusagalactiae; Francisellatularensis; Corynebacteriumdiphtheriae; Brucella spp.Finally, itisworthhighlightingthattheinfluenceoftheapproacheswithbigdataandartificial intelligenceareincreasingandtheinfluencesoftheseinPan-genomicstudieswillbringa neweraofstudiesanddiscoveries.

2.5Applicationsofpan-genomicsinaquaticpathogenicbacteria

Thesustainabilityofaquacult ureindustryiscriticalbothforglobalfoodsecurityand economicwelfare.However, themassivewealthofpathogenicbacteriaposesakey challengetothedevelopmentofasustainablebiocontrolmethod.Recentadvances ingenomesequencingstudycombinedwithpan-genomeanalysiscanbeanefficacious managementappliedtonumerousaquaticpathogens [36].Thus,routinepangenome analysesofgenomic-derivedaquaticpathogenswilldeducethephylogenomicdiversity andpossibleevolutionarytrendsofaquaticbacterialpathogenstrains,elucidatethe mechanismsofpathogenesis,aswellasestim atepatternsofpathogentransmissionacross epidemiologicalscales.Thewholegenome sequencingdataisth eopportunitytorevolutionizethemolecularepidemiologyofaquaculturepathogensasithasforthose pathogensofrelevancetopublichealth [37].Challengesofaquaculturediseasemanagementarethebiologicaldiversityofpathogens,host-pathogeninteractions(e.g.,differentmodesofadaptationandtransmission ),andshiftingenvironmentalpressures,in particularclimatechange.Hence,analysisofpathogenicphenotypecombinedwith genotypederivedfromthefullpotentialof genomesequencingdataiscriticaltoreconstructpathogentransmissionroutesonlocalandglobalscales,aswellasmitigatedisease emergenceandspread.

Comparativepan-genomeanalysesareaneffectivetoolwhichcouldpossiblybe extendedtotheanalysisofaquaticmicroorganismsandtodynamiccharacteristicsand adaptationtoabroadrangeoftheirhostsandenvironmentalniches.Conspicuously, ourpreviouspan-genomeanalysis [38] showedthatstrainWFLU12isolatedfrommarine fishexhibitedniche-specificcharacteristicsofenergyproductionandconversion,and carbohydratetransportandmetabolismbyexploringgenesinthegenerepertoireof strains.Basedonthepan-genomecategories,thefunctionalannotationsofselectedgenes canbereanalyzedwiththeVirulenceFactorsDatabase(VFDB),ClustersofOrthologous Groups(COG),KyotoEncyclopediaofGenesandGenomes(KEGG),andAntibiotic ResistanceGenesDatabase(ARDB).Also,comparativepan-genomehasadvancedto thepointwhengenesarepredictedasbelongingtocellsurface-exposedproteins (SEPs)fromimportantpathogens,includingoutermembraneproteins,andextracellular proteins.Thesepredictedgenesareservingasvaccinecandidatesinananimalmodel calledReversedVaccinology(RV) [39].Inaquaculture,SEPsfrompathogensinclude severalimportantvirulencefactorsthatplaykeyrolesinbacterialpathogenesisandhost immuneresponses.Forexample,theexpressionof esa1 from Edwardsiellatarda,aD15likesurfaceantigen,intheJapanesefloundermodelinducedtheexpressionofabroad spectrumofgenespossiblyinvolvedinbothinnateandadaptiveimmunity,aswellas ahighleveloffishsurvivalandproducedspecificserumantibodies [40].Vaccination usingSEPsresultsinthedevelopmentofprotectiveeffectsagainst Aeromonashydrophila infection, Flavobacteriumcolumnare infection, Pseudomonasputida infection,andEdwardsiellosis[asinthereviewofAbdelgayed [41]].Arecentstudy [42] hassuccessfullyimplementedapan-genomeanalysistoscreenSEPsfrom17representative Leptospirainterrogans strainscoveringmultiepidemicserovarsfromaroundtheworld,and118newcandidate antigenswereidentifiedinadditiontoseveralknownoutermembraneproteinsandlipoproteins.Wehighlyconsiderthattherapidincreaseinthenumberofgenomesequencing ofaquaticpathogenswillallowustodeveloparapid-responseinfectioncontrolprotocols,butalsobeapotentialtrendforstudyingaquaticpathogenicbacteriatoimprovethe cross-serotypeefficacyofvaccinesinfarmedfishandstemthediseaseoutbreakwhen implementingpan-genomeanalysis(usingRVstrategy).Inthechapter“Pan-genomics ofaquaticanimalpathogensanditsapplications,”wereviewedcomparativepan-genome analysiswithaparticularfocusoncontrollingaquaticdiseasesandgivereal-worldexamplesbyanalyzinggenomesequencingdataderivedfromaquaticbacterialisolates.

2.6Pan-genomicsapplicationsfortherapeutics

Theemergenceofbacterialresistanceisoccurring,threateningtheabilityofantibiotics thathavetransformedmedicineandsavedmillionsoflivesaroundtheglobe [43,44].The occurrenceofbacterialresistancehasbeenidentifiedsincethebeginningoftheantibiotic erabuttheemergenceofmostdangerousandeasilycommunicatedstrainshasbeen

reportedinpasttwodecades [45,46].Afterseveralyearsofthefirstpatienttreatedwith antibiotics,bacterialinfectionsbecameathreatforsocietyonceagain.Thissituationis mainlybecauseofthemisuseand/oroveruseofantibioticsaswellastheinefficiency ofpharmaceuticalcompaniesfornotproducingadvanceddrugs,onceeconomicinvestmentshavebeenreduced [44].TheCentersforDiseaseControlandPrevention(CDC) hascategorizedseveralbacterialstrainsasanalarmingthreatthatneedseriousconsiderationforpropertreatmentandarealreadyresponsibleforputtingsignificantburdenon thehealth-caresystemintheUnitedStates(US),ultimately,affectingpatientsandtheir families [43,47,48].Theinfectionscausedbyantibiotic-resistantstrainsofbacteriaare pervasiveworldwide [43,44].Anationalsurveyofinfectious-diseasespecialistsledbythe IDSAEmergingInfectionsNetworkin2011foundthatabouttwo-third(2/3)ofthe participantshadseenapan-resistantanddeadlybacterialinfectionwithinthepastfew years [49].Therapidemergenceofresistantbacteriahasbeendescribedasanightmare byseveralpublichealthorganizationsthatcouldhavedisastrousresults [50].TheWHO cautionedin2014thatthedisasterofantibioticresistanceisbecomingdreadful [51] AmongGram-positivepathogens,auniversalendemicofresistant S.aureus and Enterococcus speciesarepresentlythebiggestintimidation [48].Vancomycin-resistantenterococci(VRE)andadditionalemergentpathogensareevolvingresistancetonumerous antibioticsusedcommonly [43].Theworldwidedistributionofcommonrespiratory pathogensincludes Streptococcuspneumoniae and Mycobacteriumtuberculosis,whichare reportedasepidemic [48].Gram-negativepathogensareingeneralmoretroublesome becauseofthefactthattheyarebecomingmoreresistanttoalmostalltheavailabletherapeutics,makingtheconditionsevocativetothepreantibioticera [44].Theoccurrence ofmultidrugresistant(MDR)Gram-negativebacillihasoutdatedallthepracticeinfield ofmedicine [43].ThemostcommoninfectionscausedbyGram-negativebacteriain health-caresettingsareusuallyby Enterobacteriaceae (mostly Klebsiellapneumoniae), Acinetobacter,and Pseudomonasaeruginosa [43,44].Theevolutionofbacterialstrainsanddevelopmentofantibiotic-resistantgenesthroughHGTmakeitnecessarytolookfornovel andadvancedstrategiestocopewiththeinfections [52]

Theinsilicoapproacheslikepan-genome,pan-modelome,subtractivegenomics,and reversevaccinologyareplayingvitalrolesinrapididentificationofnewtherapeutictargetsinthepostgenomicera [53–55].Comparativemicrobialgenomicsapproachalong withstatisticalanalysisareusefultoolsfortheidentificationofessentialgeneticcontents commonlypresentinallpathogenicisolates,basedonsequencesimilarity.Inadditionto essentialgeneticcontents,italsohelpstoidentifysubsetofgenesencodingvirulenceand novelfunctionsasthevariablegenome [56].Apan-genomeisusuallydividedintothree parts,thatis,coregenes,accessorygenes,andstrain-specificgenes.Inthedrugand vaccinediscoveryprocess,theveryfirststepisalwaystheidentificationofasuitable target.Subtractivegenomicsisawidelyusedprocessinthisregard.Inrecentpast,workingwithpathogenicbacteria,usingcomputationalapproaches,alargenumberofnovel

Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.