System on chip interfaces for low power design 1st edition mishra - Download the ebook in PDF with a

Page 1


https://ebookmass.com/product/system-on-chip-interfaces-forlow-power-design-1st-edition-mishra/

Instant digital products (PDF, ePub, MOBI) ready for you

Download now and discover formats that fit your needs...

Waste-to-Resource System Design for Low-Carbon Circular Economy Siming You

https://ebookmass.com/product/waste-to-resource-system-design-for-lowcarbon-circular-economy-siming-you/

ebookmass.com

Sustainable Wireless Network-On-chip Architectures 1st Edition Murray

https://ebookmass.com/product/sustainable-wireless-network-on-chiparchitectures-1st-edition-murray/

ebookmass.com

Low Power FM For Dummies Sharon Scott

https://ebookmass.com/product/low-power-fm-for-dummies-sharon-scott/

ebookmass.com

A Theory of Global Governance: Authority, Legitimacy, and Contestation Michael Zurn

https://ebookmass.com/product/a-theory-of-global-governance-authoritylegitimacy-and-contestation-michael-zurn/

ebookmass.com

Crécy: The Age of the Archer (Sir John Hawkwood Book 1) Griff Hosker [Hosker

https://ebookmass.com/product/crecy-the-age-of-the-archer-sir-johnhawkwood-book-1-griff-hosker-hosker/

ebookmass.com

The Girl in White Lindsay Currie

https://ebookmass.com/product/the-girl-in-white-lindsay-currie-3/

ebookmass.com

The Dance Tree Kiran Millwood Hargrave

https://ebookmass.com/product/the-dance-tree-kiran-millwood-hargrave/

ebookmass.com

Fundamentals of Trace and Log Analysis: A Pattern-Oriented Approach to Monitoring, Diagnostics, and Debugging 1st Edition Dmitry Vostokov

https://ebookmass.com/product/fundamentals-of-trace-and-log-analysisa-pattern-oriented-approach-to-monitoring-diagnostics-anddebugging-1st-edition-dmitry-vostokov-2/

ebookmass.com

The Heir of Magic and Moonlight (Twingenuity Series Book 2) Danielle Hill

https://ebookmass.com/product/the-heir-of-magic-and-moonlighttwingenuity-series-book-2-danielle-hill/

ebookmass.com

Henke’s Med-Math: Dosage Calculation, Preparation, and Administration 8th Edition – Ebook PDF Version

https://ebookmass.com/product/henkes-med-math-dosage-calculationpreparation-and-administration-8th-edition-ebook-pdf-version/

ebookmass.com

SystemonChipInterfacesfor LowPowerDesign

SystemonChipInterfacesfor LowPowerDesign

SanjeebMishra

NeerajKumarSingh

VijayakrishnanRousseau

AcquiringEditor: ToddGreen

EditorialProjectManager: LindsayLawrence

ProjectManager: PunithavathyGovindaradjane

Designer: AlanStudholme

MorganKaufmannisanimprintofElsevier 225WymanStreet,Waltham,MA02451,USA

Copyright # 2016SanjeebMishra,NeerajKumarSingh,andVijayakrishnanRousseau.PublishedbyElsevierInc. Allrightsreserved.

IntelownscopyrightforthematerialscreatedbytheAuthorsinthescopeoftheAuthor’semploymentatIntel.

Theviewsandopinionsexpressedinthisworkarethoseoftheauthorsanddonotnecessarilyrepresenttheviewsof IntelCorporation.

Nopartofthispublicationmaybereproducedortransmittedinanyformorbyanymeans,electronicormechanical, includingphotocopying,recording,oranyinformationstorageandretrievalsystem,withoutpermissioninwritingfromthe publisher.Detailsonhowtoseekpermission,furtherinformationaboutthePublisher’spermissionspoliciesandour arrangementswithorganizationssuchastheCopyrightClearanceCenterandtheCopyrightLicensingAgency,canbe foundatourwebsite: www.elsevier.com/permissions

ThisbookandtheindividualcontributionscontainedinitareprotectedundercopyrightbythePublisher(otherthan asmaybenotedherein).

Notices

Knowledgeandbestpracticeinthisfieldareconstantlychanging.Asnewresearchandexperiencebroadenour understanding,changesinresearchmethods,professionalpractices,ormedicaltreatmentmaybecomenecessary.

Practitionersandresearchersmustalwaysrelyontheirownexperienceandknowledgeinevaluatingandusingany information,methods,compounds,orexperimentsdescribedherein.Inusingsuchinformationormethodstheyshouldbe mindfuloftheirownsafetyandthesafetyofothers,includingpartiesforwhomtheyhaveaprofessionalresponsibility.

Tothefullestextentofthelaw,neitherthePublishernortheauthors,contributors,oreditors,assumeanyliability foranyinjuryand/ordamagetopersonsorpropertyasamatterofproductsliability,negligenceorotherwise,or fromanyuseoroperationofanymethods,products,instructions,orideascontainedinthematerialherein.

ISBN:978-0-12-801630-5

BritishLibraryCataloguinginPublicationData

AcataloguerecordforthisbookisavailablefromtheBritishLibrary

LibraryofCongressCataloging-in-PublicationData

AcatalogrecordforthisbookisavailablefromtheLibraryofCongress

ForinformationonallMKpublications visitourwebsiteat www.mkp.com

CopyrightPermissions

Designationsusedbycompaniestodistinguishtheirproductsareoftenclaimedastrademarksorregisteredtrademarks.InallinstancesinwhichMorganKaufmannPublishersisawareofaclaim,the productnamesappearininitialcapitalorallcapitalletters.Readers,however,shouldcontactthe appropriatecompaniesformorecompleteinformationregardingtrademarksandregistration.

CSI-2℠,D-PHY℠,andDSI℠ areservicemarks,andSLIMbus® isaregisteredtrademark,ofMIPI Alliance,Inc.intheUSandothercountries.MIPI,MIPIAlliance,andthedottedrainbowarchandall relatedtrademarksandtradenamesaretheexclusivepropertyofMIPIAlliance,Inc.,andcannotbe usedwithoutitsexpresspriorwrittenpermission.

FiguresbelowareCopyright # 2005-2015byMIPIAlliance,Inc.andusedbypermission.Allrights reserved.

Figures4.35,4.36,4.37,4.38,4.40,4.41,4.42,4.43,4.44,4.45,4.46,4.47,4.48,4.48,4.49,4.50, 4.51,4.52,4.53,4.54,4.55,5.12,5.13,5.14,5.17,5.18,5.19,5.20,5.21,5.55,5.57,5.58,5.59, 5.60,5.61,5.62,and5.63

FigurebelowisreprintedwithpermissiongrantedbyANSIonbehalfofINCITStousematerialfrom INCITS452-2009[R2014].Allcopyrightsremaininfulleffect.Allrightsreserved.

Figure7.6

FiguresbelowarereprintedwithpermissionfromtheVideoElectronicsStandardsAssociation, CopyrightVESA, www.VESA.org.Allrightsreserved.

Tables4.1,4.2,4.3,4.4,4.5and4.6

Figures4.13,4.14,4.15,4.17,4.18,4.19,4.23,4.26,4.27,4.28,4.29,4.30,4.31,and4.32

FiguresbelowarereprintedwithpermissionfromHigh-DefinitionMultimediaInterfaceVersion 1.3a,Copyright,HDMI1.3a.Allrightsreserved.

Figures4.9,4.10,4.11,and4.12. Tables4.11and4.12

FiguresbelowarereprintedwithpermissionfromPCI-SIG.Allrightsreserved.

Figures5.42,5.44,5.46,5.47,5.52,and5.54

FiguresbelowarereprintedwithpermissionfromUSB3.0.CopyrightIntelCorporation.Allrights reserved.

Figures5.26,5.27,C.1,C.2,C.3,C.4,andC.5.

FigurebelowisreprintedwithpermissionfromIntel® LPCinterfacespecificationv1.1,copyright # 2002,IntelCorporation.

Figure8.3

Dedicatedtomylatebrother,Sandeep, whopassedawayatsuchayoungage.

—NeerajKumarSingh

Acknowledgments

Wewouldliketoexpressgratitudetothepeoplewhohelpedusthroughthisbook;someofthem directlyandmanyothersindirectly.It’simpossibletonotriskmissingsomeone,butwewill attemptanyway.

Firstandforemost,wewouldliketoacknowledgeBalamuraliGouthamanforwritingthesensor,security,andinput/outputinterfacechapters;andKiranMathforhishelponthestorage sectionofthebook.

WewouldliketothankStuartDouglasandDavidClarkfortheirhelpinreviewingtheconcept, structure,andcontentofthebookandarrangingforpublishingwithElsevier—David,your meticulousreviewshelpedthebooksignificantly.

ThankyousomuchToddGreen,LindsayLawrence,PunithaGovindaradjaneandallthe Elsevierpublishingteamfortheoutstandingwork,help,guidance,andsupport;youhavegone theextramiletomakethebookwhatitis.

WewouldliketothankIntelmanagement,inparticularPramodMaliandSiddanagoudaS.,for thesupportandencouragement.

Aboveall,wethankourfamilyandfriendsfortheirunderstanding,support,andforbeingcontinuoussourcesofencouragement.

Chapter 1

SoCDesignFundamentalsandEvolution

Thischapterdiscussesvarioussystemdesignintegrationmethodologies alongwiththeiradvantagesanddisadvantages.Thechapteralsoexplains themotivationforcurrentsystemdesignstomovefrom“systemonboard” designstoward“systemonchip”(SoC)designs.IndiscussingthemotivationforthemovetowardSoCdesign,thechapteralsodiscussesthetypical chipdesignflowtradeoffsaswellashowtheyinfluencethedesignchoices.

INTRODUCTION

Asystemissomethingthatachievesameaningfulpurpose.Likeeverything else,itdependsonthecontext.Acomputersystemwillhavehardwarecomponents(theactualmachinery)andsoftwarecomponents,whichactually drivethehardwaretoachievethepurpose.Forexample,talkingaboutapersonalcomputer(alsocommonlyknownasaPC),alltheelectronicsarehardware,andtheoperatingsystemplusadditionalapplicationsthatyouuseare software.

However,inthecontextofthisbook,bya system wemeanthehardwarepart ofthesystemalone. Figure1.1 showsaroughblockdiagramofasystem. Thesysteminthediagramconsistsofaprocessingunitalongwiththeinput/ outputdevices,memory,andstoragecomponents.

Typicalsystemcomponents

Roughlyspeaking,atypicalsystemwouldhaveaprocessortodothereal processing,amemorycomponenttostorethedataandcode,somekind ofinputsystemtoreceiveinput,andaunitforoutput.Inaddition,weshould haveaninterconnectionnetworktoconnectthevariouscomponents togethersothattheyworkinacoherentmanner.Itshouldbenotedthat basedontheusagemodelandapplicabilityofthesystem,thevariouscomponentsinthesystemmaycomeindifferingformats.Forexample,inaPC environment,keyboard,andmousemayformtheinputsubsystem,whereas inatabletsystemtheymaybereplacedbyatouchscreen,andinadigital

healthmonitoringsystemtheinputsystemmaybeformedbyagroupofsensors.Inadditiontothebareessentials,theremaybeothersubsystemslike imaging,audio,andcommunication.In Chapter3 we’lltalkaboutvarious subsystemsingeneral,involving:

1. Processor

2. Memory

3. Inputandoutput

4. Interconnects

5. Domain-specificsubsystems(camera,audio,communication,and soon)

Categorizationofcomputersystems

Computersystemsarebroadlycategorizedas:general-purposecomputer systemslikepersonalcomputers,embeddedsystemsliketemperaturecontrolsystems,andreal-timesystemslikebrakingcontrolsystems.Generalpurposecomputingsystemsaredesignedtobemoreflexiblesothatthey canbeusedfordifferenttypesoffunctions,whereasembeddedsystems aredesignedtoaddressaspecificfunctionandarenotmeanttobegeneric. Theyareusuallyembeddedaspartofalargerdevice,andtheuserseldom directlyinteractswithsuchasystem.Real-timesystemsareembedded systemswithstringentresponsetimerequirements.Allthesecomputing systemsarebuiltusingthesamebasicbuildingblocksasshownin Figure1.1.Theflavorofthebuildingblocksmayvaryfromsystemtosystembecausethedesignparametersanddesignrequirementsaredifferent. Forexample,sinceembeddedsystemshaveafixedknownusage,thecomponentscanoptimallybechosentomeetthatfunctionalrequirement.The

Main memory
Secondary memory/storage
Input devices
Processor
Output devices
n FIGURE1.1 Asystemwithmemory,processor,input/output,andinterconnects.

general-purposesystem,ontheotherhand,mighthavetosupportarangeof functionalityandworkloads,andthereforecomponentsneedtobechosen keepinginmindthecostanduserexperiencefortherangeofapplications. Similarlythecomponentsforreal-timesystemsneedtobechosensuchthat theycanmeettheresponsetimerequirement.

SYSTEMAPPROACHTODESIGN

Duetothetighterbudgetoncost,power,andperformancediscussedinthe previoussection,thewholesystemisbeingthoughtaboutanddesignedas completeandnotasanassemblyofdiscretepieces.Thephilosophyofsystemdesigntherebybringstheopportunitytooptimizethesystemforaparticularusage.Thereisnorealchangeinthesystemfunctionality;it’sjust thatitisadifferentwayofthinkingaboutthesystemdesign.Wealready talkedaboutthetypicalsystemcomponents;nextwewilldiscussthehardwaresoftwareco-design,followedbyvarioussystemdesignmethodologies.

Hardwaresoftwareco-design

Asdiscussedearlier,asystemingeneralhassomehardwareaccompanied bysomesoftwaretoachievethepurpose.Generally,thesystem’sfunctionalityasawholeisspecifiedbythebaredefinitionofthesystem.However, whatpartofthesystemshouldbededicatedhardwareandwhatshouldbe softwareisadecisionmadebythesystemarchitectanddesigner.Theprocessoflookingatthesystemasawholeandmakingdecisionsastowhat becomeshardwareandwhatbecomesasoftwarecomponentiscalled hardwaresoftwareco-design.Typicallytherearethreefactorsthatinfluencethe decision:

n Input,output,memory,andinterconnectsneedtohavehardware (electronics)todothefundamentalpartexpectedfromthem.However, eachoftheseblockstypicallyrequiressomeprocessing;forexample, touchdatareceivedfromtheinputblockneedstobeprocessedtodetect gestures,ortheoutputdataneedstobeformattedspecificallytosuitthe display.Theseprocessingparts,generallyspeaking,arepartofthe debateastowhetheradedicatedhardwarepieceshoulddothe processingorwhetherthegeneral-purposeprocessorshouldbeableto takecareoftheprocessinginpartorfull.

n Thesecondfactorthatcontributestothedecisionistheexperience thatwewanttodelivertotheuser.Whatthismeansisthat,dependingon theamountofdatathatneedstobeprocessed,thequalityofthe outputthatisexpected,theresponsetimetotheinput,andsoon,wehave todecidethequalityofthededicatedhardwarethatshouldbeused,

andalsothishelpsmakethedecisionastowhichprocessingshouldbe donebydedicatedhardwareandwhichbysoftwarerunningontheCPU. Theassumptionhereisthathardwarededicatedtodoingspecific processingwillbefasterandmoreefficient,sowhereverweneedfaster processing,wededicatehardwaretodotheprocessing,suchas,for example,graphicsprocessingbeingprocessedbyagraphics processingunit.

n Thethirdfactoristheoptimality.Therearecertaintypesofprocessing thattakealotmoretimeandenergywhendonebygeneral-purpose processingunitsasopposedtoaspecializedcustomprocessor,suchas digitalsignalprocessingandfloatingpointcomputations,whichhave dedicatedhardware(DSPunitandfloatingpointunit,respectively) becausetheyareoptimallydoneinhardware.

Systemdesignmethodologies

Earlyon,thescaleofintegrationwaslow,andthereforetocreateasystemit wasnecessarytoputmultiplechips,orintegratedcircuits(ICs),together. Today,withvery-large-scaleintegration(VLSI),designingasystemona singlechipispossible.So,justlikeanyotherstream,systemdesignhas evolvedbasedonthetechnologicalpossibilitiesofthegeneration.Despite thefactthatsystemonasinglechipispossible,however,thereisnoone designthatfitsall.Incertaincasesthedesignissocomplexthatitmay notfitonasinglechip.Why?Basedonthetransistorsize(whichislimited bytheprocesstechnology)andsizeofthedie(againlimitedbytheprocess technology)thereisalimitednumberoftransistorsthatcanbeplacedona chip.Ifthefunctionalityiscomplexandcannotbeimplementedinthatlimitednumberoftransistors,thedesignhastobebrokenoutintomultiple chips.Also,thereareotherscalabilityandmodularityreasonsfornot designingthewholesysteminonesinglechip.Inthefollowingsectionwe’ll discussthethreemajorsystemdesignapproaches:systemonboard(SoB), systemonchip(SoC),andsysteminapackage(SiP)oronapackage(SoP).

Systemonboard

SoBstandsforsystemonboard.Thisistheearliestevolutionofsystem design.Backinthe1970sand1980swhenasinglechipcoulddoonlyso much,thesystemwasdividedintomultiplechipsandallthesechipswere connectedviaexternalinterconnectinterfacesoveraprintedcircuitboard. SoBdesignsarestillapplicabletodayforlargesystemdesignsandsystem designsinwhichdisparatecomponentsneedtobeputtogethertoworkasa system.

AdvantagesofSoB

Despitethefactthatthisistheearliestapproachtosystemdesignandbackin theearlydaysitwastheonlyapproachfeasibletobeabletodoanything meaningful,theSoBdesignapproachisprevalenteventodayandhasa lotofadvantagesoverotherdesignapproaches:

n Itisquickandeasytododesignspaceexplorationwithdifferent components.

n Proven(prevalidatedandused)componentscanbeputtogethereasily.

n Designcomplexityforindividualchipsisdivided,sotheriskofabug isless.

n Thedebuggingofissuesbetweentwocomponentsiseasierbecausethe externalinterfacescanbeprobedeasily.

n Individualcomponentscanbedesigned,manufactured,anddebugged separately.

DisadvantagesofSoB

SincethereisamovetowardSiP/SoPandSoC,theremustbesomedisadvantagestotheclassicalSoBdesignapproach;thesecanbesummarizedas follows:

n Becauseoflongconnectivity/interconnects,thesystemconsumesmore powerandprovideslessperformancewhencomparedtoSoC/SiP/SoP designs.

n Overallsystemcostisgreaterbecauseoflargersize,morematerials requiredinmanufacturing,higherintegrationcost,andsoon.

n Sinceindividualcomponentsaremadeandvalidatedseparately,they cannotbecustomizedoroptimizedtoaparticularsystemrequirementor design.

Systemonchip

BydefinitionSoCmeansacompletesystemonasinglechipwithnoauxiliarycomponentsoutsideit.ThecurrenttrendtodayisthatallthesemiconductorcompaniesaremovingtowardSoCdesignsbyintegratingmoreand morecomponentsofasystemasSoC.However,thereisnotasingleexampleofapureSoCdesign.

AdvantagesofSoC

SomeoftheadvantagesofSoCdesignare

n lowersystemcost, n compactsystemsize, n reducedsystempowerconsumption,

n increasedsystemperformance,and n intellectualpropertyblocks(IPs)usedinthedesigncanbecustomized andoptimized.

DisadvantagesofSoC

EventhoughitlooksasthoughSoCdesignisveryappealing,therearelimitations,challenges,andreasonsthatnoteverythinghasmovedtoSoC. Someofthereasonsareoutlinedbelow:

n Forbigdesigns,fittingthewholelogiconasinglechipmaynotbe possible.

n Siliconmanufacturingyieldmaynotbeasgoodbecauseofthebigdie sizerequired.

n TherecanbeIPlibrary/resourceproviderandlegalissues.

n Chipintegration:Componentsdesignedwithdifferentmanufacturer processesneedtobeintegratedandmanufacturedononeprocess technology.

n Chipdesignverificationisachallengebecauseofthehugemonolithic design.

n Chipvalidationisachallenge,alsobecauseofthemonolithicdesign.

Systeminapackage

SiPorSoPdesignisapracticalalternativetocounterthechallengesposed bytheSoCapproach.Inthisapproach,variouschipsaremanufacturedseparately;however,theyarepackagedinsuchawaythattheyareplacedvery closely.Thisisalsocalleda multichipmodule (MCM)or multichippackage (MCP).ThisisakindofmiddlegroundbetweenSoBandSoCdesign methodologies.

AdvantagesofSiP

Inthisapproachthechipsareplacedcloseenoughtogivecompactsize, reducedsystempowerconsumption,andincreasedsystemperformance. Inaddition:

n IPsbasedondifferentmanufacturingtechnologiescanbemanufactured ontheirowntechnologiesandpackagedasasystem.

n Becauseofsmallersizesofindividualchips,themanufacturingyield isbetter.

n Developmentcomplexityislessbecauseofdivisionofdesigninto multipleparts.

n Bigdesignsthatcannotbemanufacturedasasinglechipcanbemadeas aSiP/SoP.

DisadvantagesofSiP

Despitethefactthatthedifferentchipsareplacedverycloselytominimize thetransmissionlatency,theSiPdesignislessthanoptimalintermsof powerandperformanceefficiencywhencomparedtoSoCdesigns.Inaddition,thepackagingtechnologyfortheMCM/MCPsystemismorecomplex andmorecostly.

Inmostoftheliterature, SiP and SoP areusedinterchangeably;however, sometimestheyhavedifferentmeanings.SiPreferstoverticalstackingof multiplechipsinapackageandSoPreferstoplanarplacementofmorethan onechipinapackage.Forexample,aSiPorSoPcancontainmultiplecomponentslikeprocessor,mainmemory,flashmemoryalongwiththeinterconnects,andauxiliarycomponentslikeresistor/capacitoronthesame substratetomakeitaSiPorSoP.

Application-specificintegratedcircuit

Application-specificintegratedcircuit(ASIC)isafunctionalblockthatdoes aspecificjobandisnotsupposedtobeageneral-purposeprocessingunit. ASICdesignsarecustomizedforaspecificpurposeorfunctionalityand thereforeyieldsmuchbetterperformancewhencomparedtogeneralpurposeprocessingunits.ASICdesignsarenotacompetingdesignmethodologytoSoC,butrathercomplementary.So,whendesigninganSoC, thedesignermakesadecisionastowhatIPsorfunctionalblockstointegrate.AndthatdecisioncomesbasedonwhethertheSoCismeanttobe generalpurpose,cateringtovariousdifferentapplicationneeds(likeatablet SoCthatcanbeusedwithdifferentoperatingsystemsandthencustomized toserveasrouterordigitalTVorGPSsystem),orforaspecificpurposethat issupposedtocatertoonlyaspecificapplication(e.g.,aGPSnavigator).

AdvantagesofASIC

So,onemightthinkthatitisalwaysbettertomakeageneral-purposeSoC, whichcancatertomorethanjustoneapplication.However,therearesignificantreasonstochoosetomakeanASICoverageneral-purposeSoC:

n Cost:Whenwemakeageneral-purposeSoCanditiscustomizedfora specificpurpose,agoodpieceoflogiciswastedbecauseitisnotusedfor thatspecificapplication.IncaseofASIC,thesystemorSoCismadeto suit;thereisnoredundantfunctionality.Andthereforethedieareaofthe systemissmaller.

n Validation:ValidationofanASICismucheasierthanthegeneralpurposeSoC.Why?Becausewhenavendorcreatesageneral-purpose SoCandmarketsitassuch,thereareaninfinitenumberofpossibilities

forwhichthatSoCcanbeused,andthereforethevendorneedsto validatetoitsspecificationtoperfection.Ontheotherhand,whenone createsanASIC,thatpieceissupposedtobeusedforthatspecific purpose.Therefore,thevendorcanlivewithvalidationoftheASICfor thattargetedapplication.

n Optimization:Sinceit’sknownthattheASICwillbeusedforthe specificapplication,thedesignchoicescanbemademoreintelligently andoptimally;forexample,howmuchmemory,howmuchshouldbe memorythroughput,howmuchofprocessingpowerisneeded,and soon.

DisadvantagesofASIC

Therearealwaystradeoffs.Ofcourse,therearesomedisadvantagestothe ASICdesignapproach:

n Weallknowthatthehardwaredesignandthemanufacturingcycleare longandintensive(effortandcost).So,makinganASICforevery possibleapplicationisnotgoingtobecosteffective,unlesswecan guaranteethatthevolumeofeachsuchASICwillbehuge.

n Customerswantonesystemtobeabletodomultiplethings,ratherthan carryingonedeviceforGPS,oneforphonecalls,anotherforInternet browsing,anotheroneforentertainment(mediaplayback),andyet anotheroneforimaging.Also,sincetherearecommonfunctionblocks ineachofthesesystems,itismuchcheapertomakeonesystemtodoit all,whencomparedwithamortizedcostofallthedifferentsystems,each dedicatedforonefunctionality.

Systemonprogrammablechip

Becauseofaneedforfastdesignspaceexploration,anewtrendisfastgaininginpopularity:thesystemonaprogrammablechip,orSoPC.InanSoPC solutionthereisanembeddedprocessorwithon-chipperipheralsandmemoryalongwithlotsofgatesinafield-programmablegatearray(FPGA).The FPGAcanbeprogrammedwiththedesignlogictoemulate,andthesystem behavior,orfunctionality,canbeverified.

AdvantageofSoPC

SoPCdesignsarereconfigurableandthereforecanbeusedforprototyping andvalidatingthesystem.BugfixesaremucheasiertomakeinthisenvironmentthaninanSoCdesign,whereinoneneedstochurninanotherversionofsilicontofixandverifyabug,whichhasasignificantcost.

DisadvantageofSoPC

TheSoPCdesignmodelsthefunctionalityinanFPGA,whichisnotasfast asrealsiliconwouldbe.Itisthereforebestfitforthesystemprototypingand validation,andnotreallyforthefinalproduct.

Systemdesigntrends

Asweseefromtheprecedingdiscussion,therearemanyapproachestoa systemdesign:onemoresuitableforonescenariothanother.Itshould,however,benotedthattheSoCapproach,whereverpossible,bringsmany advantagestothedesign.Andtherefore,notsurprisingly,theSoCapproach isthetrend.However,forvariousreasonsapureSoCinidealtermsisnot possibleforarealsystem.Infact,initiallyitwasonlypossibletodesign smallerembeddeddevicesasSoCduetothelimitednumberoftransistors onachip.Itisnowpossibletointegrateevenageneral-purposecomputing deviceontoasinglechipbecauseMoore’slawhasallowedmoretransistors onasinglechip.SoCsforgeneral-purposecomputingdevicesliketablets, netbooks,ultrabooks,andsmartphonesarepossiblethesedays.Giventhe advantagesofSoCdesign,thelevelofintegrationinachipisgoingtodecide thefateofonecorporationversusanother.

HARDWAREICDESIGNFUNDAMENTALS

Intheprevioussectionwetalkedaboutvarioussystemdesignapproaches andtheconceptofhardwaresoftwareco-design.Irrespectiveofthesystem designmethodology,thecomputersystemismadeofICs.Weallknowthat theintegratedchipdesignisacomplexpipelineofprocessculminatinginan ICchipthatcomesoutofmanufacturing.Inthissectionwetalkalittlebit aboutthepipelineofprocessesinanICdesign.

ThebasicbuildingblockofanyICisatransistor,andmultipletransistorsare putandconnectedtogetherinaspecificwaytoimplementthebehaviorthat wewantfromthesystem.Sincetheadventoftransistorsjustafewdecades back,thesizeoftransistorshasgonedownexponentially,andthereforethe numberoftransistorsintegratedinachiphasgrownsimilarly.Justtobring insomeperspective,thenumberoftransistorsonachipin1966wasabout 10ascomparedtobillionsoftransistorsonthelatestonein2014.

Theminimumwidthofthetransistorisdefinedbythemanufacturingprocesstechnology.Foracademicpurposes,thelevelofintegrationhasbeen classifiedbasedonitsevolution:

1. SSI ¼ small-scaleintegration(upto10gates)

2. MSI ¼ medium-scaleintegration(upto1000gates)

3. LSI ¼ large-scaleintegration(upto10,000gates)

4. VLSI ¼ very-large-scaleintegration(over10,000gates)

Giventhecomplexityofthedesignstoday,theICdesignfollowsavery detailedandestablishedprocessfromspecificationtomanufacturingthe IC. Figure1.2 illustratestheprocess.

CHIPDESIGNTRADEOFF

Tradeoffisthewaytolife.Tradeoffbetweencostandperformanceisfundamentaltoanysystemdesign.Costofthesiliconisadirectfunctionofthe areaofthediebeingused,discountingtheotherone-timeexpensesin designingtheIC.Butchangingsetoftheusagemodelandexpectationfrom Physical verification System specification Architectural design

n FIGURE1.2 High-levelflowofchipdesign.

thecomputersystemhasbroughtintwoothermajordesigntradeoffs:thatof powerandthatofconfigurabilityandmodularity.

Poweruntilafewyearsagowasaconcernonlyformobiledevices.Itisand wasimportantformobiledevicesbecause,withthesmallbatterysizes requiredduetoportabilityandothersimilarreasons,itisimperativethat thepowerconsumptionforthefunctionalityisoptimal.However,asthe hardwaredesignsgotmorecomplexandnumberofsystemsinuseinenterprisesgrewexponentiallytohandletheexponentialgrowthintheworkload, enterprisesrealizedthattheelectricitybill(that’stherunningcostofthe computersystems)wasequally(ormaybemore)importantthanone-time systemcost.Sothechipvendorsstartedtoquotepowerefficiencyinterms ofperformanceperwatt.Andthebuyerswillpaypremiumforpowerefficientchips.

Theotherparametersconfigurabilityandmodularityaregainingorrather havegainedimportancebecauseoftheincessantpursuittoshortentime tomarket(TTM).Theamountoftimeittakestodesign(andvalidate)a functionalblockinthechipfrombaseisquitesignificant.However,if welookatthemarket,newproducts(orsystems)arelaunchedrather quickly.So,thereisaneedforthechipvendortodesignabaseproduct andbeabletoconfigurethesameproducttocatertovariousdifferentmarket segmentswithvaryingconstraints.Theotherfactorthatisbecoming(and againinfacthasbecome)importantismodularityofthedesign.Whyis modularityimportant?ThereasonagainisthattheTTMfromconception tolaunchofaproductissmall,andthedesignofthefunctionalblocksof systemisreallycomplexandtimeconsuming.So,thesystemdevelopment companiesaretakinganapproachtousethefunctionalblocksfromother designers(vendors)asIPandintegratethatintheirproduct.ThefundamentalrequirementforsuchastitchingtogetheristhatthesystemdesignandIP designbeingsourcedmustbothbemodularsotheycanworkwitheachother seamlessly.TheapproachhelpsboththesystemdesignerandIPdesigner: thesystemdesignerbyreducingtheirTTMandtheIPdesignerbyallowing themtospecializeandselltheirIPtoasmanysystemsvendorsaspossible.

Chapter 2 UnderstandingPowerConsumption Fundamentals

Thischapterstartsbyexplainingwhypoweroptimizationisimportant,then triestohelpthereaderunderstandthesourcesofpowerconsumption, measurementorhowtomonitorpowerconsumption,anddiscussesthestrategiesappliedtoreducepowerconsumptionattheindividualICandsystem level.However,beforewestarttodelveintothedetails,afewthingsabout whyit’simportant.

WHYPOWEROPTIMIZATIONISIMPORTANT

Savingenergyisbeneficialfortheenvironmentandalsofortheuser.There isalotofliteraturethatdiscussesthebenefitsindetail,buttogivejustafew obviousexamples,thebenefitsincludelowerelectricbillsforconsumers, longeruptimeofthedeviceswhenrunningonbatterypower,andsleeker mobilesystemdesignmadepossiblebysmallerbatteriesduetoenergy efficiency.

Knowingthatpowerconservationisimportant,nextweshoulddiscussand understandthefundamentalsofpowerconsumption,itscauses,andtypes. Onceweunderstandthem,wecanbetterinvestigatewaystoconservepower.

Sincetheuseofelectronicdevicesisprevalentacrosseveryaspectofour lives,reducingpowerconsumptionmuststartatthesemiconductorlevel. Thepower-savingtechniquesthataredesignedinatthechiplevelhavea far-reachingimpact.

Inthefollowingsectionwewillcategorizepowerconsumptionintwoways: powerconsumptionattheIClevelandpowerconsumptionatthesystemlevel.

PowerconsumptioninIC

Digitallogicismadeupofflip-flopsandlogicgates,whichinturnaremade upoftransistors.Thecurrentdrawnbythesetransistorsresultsinthepower beingconsumed.

n FIGURE2.1 Diagramofatransistordepictingvoltageandcurrentflow.

Figure2.1 showsatransistor,voltage,andthecurrentcomponents involvedwhilethetransistorisfunctioning.Sofromthediagram,the energyrequiredforthetransitionstatewillbe CL *Vdd 2 .Andthepower (energy*frequency)consumptioncanbeexpressedas CL *Vdd 2 *f.Goingfurther,thepowerconsumedbythedigitallogichastwomajorcomponents: staticanddynamic.

Staticpower

Staticpoweristhepartofpowerconsumptionthatisindependentofactivity. Itconstitutesleakagepowerandstandbypower.Leakagepoweristhepower consumedbythetransistorinoffstateduetoreversebiascurrent.Theother partofstaticpower,standbypower,isduetotheconstantcurrentfrom Vdd to ground.Inthefollowingsectionwediscussleakagepoweranddynamic power.

Leakagepower

Whenthetransistorsareintheoffstatetheyareideallynotsupposedtodraw anycurrent.Thisisactuallynotthecase:Thereissomeamountofcurrent drawnevenintheoffstateduetoreversebiascurrentinthesourceanddrain diffusions,aswellasthesubthresholdcurrentduetotheinversioncharge thatexistsatgatevoltagesunderthresholdvoltage.Allofthisiscollectively referredtoas leakagecurrent.Thiscurrentisverysmallforasingletransistor;however,withinanICtherearemillionstobillionsoftransistors,so thiscurrentbecomessignificantattheIClevel.Thepowerdissipateddueto thiscurrentiscalledleakagepower.Itisduetoleakagecurrentanddepends primarilyonthemanufacturingprocessandtechnologywithwhichthe transistorsaremade.Itdoesnotdependonthefrequencyofoperationof theflip-flops.

Standbypower

Standbypowerconsumptionisduetostandbycurrent,whichisDCcurrent drawncontinuouslyfrompositivesupplyvoltage(Vdd)toground.

Dynamicpower

Dynamicpower,duetodynamiccurrent,dependsonthefrequencythetransistorisoperatingon.Dynamicpowerisalsothedominantpartoftotal powerconsumption.Dynamiccurrentagainhastwocontributors:

1. Shortcircuitcurrent,whichisduetotheDCpathbetweenthesupplies duringoutputtransition.

2. Thecapacitancecurrent,whichflowstocharge/dischargecapacitive loadsduringlogicchanges.

Thedominantsourceofpowerdissipationincomplementarymetal-oxide semiconductor(CMOS)circuitsisthecharginganddischarging.Therate atwhichthecapacitiveloadischargedanddischargedduring“logiclevel transitions”determinesthedynamicpower.Asperthefollowingequation, withtheincreaseinfrequencyofoperationthedynamicpowerincreases, unliketheleakagepower.

ForCMOSlogicgate,thedynamicpowerconsumptioncanbeexpressedas:

where CL istheloadcapacitance(thecapacitanceduetoload), C istheinternalcapacitanceoftheIC, f isthefrequencyofoperation,and N isthenumber ofbitsthatareswitching.

So,fundamentally,asperformanceincreases(meaningthespeedandfrequencyoftheICincreases)theamountofdynamicpoweralsoincreases. Itcanalsobenotedthatdynamicpowerisdatadependentandiscloselytied tothenumberoftransistorsthatchangestates.

Asisevidentfromtheequationforpowerconsumption,attheIClevel,we haveafewfactorstotweakin:voltage,frequency,capacitance,andthe numberoftransitions,tocontrolorreducethepowerconsumption.

PoweroptimizationinIC

So,thetaskofpowerminimizationormanagementcanbedefinedas:minimizingpowerconsumptioninallmodesofoperation(bothdynamicwhen activeandstaticwhenidle/standby)withoutcompromisingontheperformancewhenneeded.Asdiscussedpreviouslythereareafewfactorsthat weneedtotweakintooptimize/minimizethepowerconsumption:

n Voltage.Asisevidentfromtheequation,loweringthesupplyvoltage quicklybringsdownthetotalpowerconsumption.So,whydon’twe bringthevoltagedownbeyondapoint?It’sbecausewepayaspeed penaltyforsupplyvoltagereduction,withdelaysdrastically increasingas Vdd approachesthethresholdvoltage(Vt)ofthedevices. Thistendstolimittheusefulrangeof Vdd toaminimumoftwoto threetimes Vt.Thelimitofhowlowthe Vt cangoissetbythe requirementtosetadequatenoisemarginsandcontroltheincreasein subthresholdleakagecurrents.Theoptimum Vt mustbedetermined basedonthecurrentgainoftheCMOSgatesatlowsupplyvoltage regimeandcontroloftheleakagecurrents.

n Capacitance.Letusnowconsiderhowtoreducephysicalcapacitance. Werecognizethatcapacitancescanbekeptataminimumbyusingless logic,smallerdevices,andfewerandshorterwires.Someofthe techniquesforreducingtheactiveareaincluderesourcesharing,logic minimization,andgatesizing.Aswithvoltage,however,wearenot freetooptimizecapacitanceindependently.Forexample,reducing devicesizesreducesphysicalcapacitance,butitalsoreducesthe currentdriveofthetransistors,makingthecircuitoperatemore slowly.Thislossinperformancemightpreventusfromlowering Vdd asmuchaswemightotherwisebeabletodo.

n Switchingactivity.Ifthereisnoswitchinginacircuit,thennodynamic powerwillbeconsumed.However,thiscouldalsomeanthatno computationoccurs.Sincetheswitchingactivityissodependenton theinputpattern,forageneral-purposeprocessor,containing switchingactivitymaynotberealistic.So,wedonotfocusonthis.It shoulddefinitely,however,bekeptinmindwhiledefiningand designingprotocolssothatweminimizetheswitchingactivitytoas littleaspossibleforaveragecasescenarios.

Applyingthefundamentalsdiscussedintheprevioussection,toaddressthe challengeofreducingpower,thesemiconductorindustryhasadoptedamultifacetedapproach,attackingtheproblemonthreefronts:

n Reducingcapacitance.Thiscanbeachievedthroughprocess developmentsuchassilicononinsulatorwithpartiallyorfully depletedwells,CMOSscalingtosubmicrondevicesizes,and advancedinterconnectsubstratessuchasmultichipmodules.Since thisisdependentontheprocesstechnologybeingused,thereisa limitationdependingonthecurrentprocesstechnology,andany advancementhasitsownpaceofdevelopment.

n Scalingthesupplyvoltagecapacitance.Thisisagainaprocessdependentfactor,anditalsorequireschangeintheauxiliarycircuit

andcomponentsinuse.However,moreimportantly,thesignal-to-noise ratioshouldbepropersothatthecommunicationisnotbrokenbecause ofnoisesignalsofcomparablestrength.

n Usingpowermanagementstrategies.Thisisoneareawherethe hardwaredesignercanmakeahugedifferencebyeffectively managingthestaticanddynamicpowerconsumption.Itshould howeverbenotedthattheactualsavingsdependalotontheusage scenarioortheapplicationofthesystem.Someexamplesofpower managementtechniquesaredynamicvoltageandfrequencyscaling (DVFS),clockgating,andsoforth,whichwillbediscussedinalittle detailinsubsequentsections.

Inthenextsectionwediscussthesestrategiesinsomedepth.Thevarious parametersinterferewitheachother,andthereforetheycannotbechosen independentofotherparametersorvectors.Forexample,CMOSdevice scaling,supplyvoltagescaling,andchoiceofcircuitarchitecturemustbe donecarefullytogetherinordertofindanoptimumpowerandperformance. Itmayseemthatprocessscalingcanhelpsolveallthepowerconsumption problems;however,weshouldnotethatleakagepoweroftheprocessof smallersizeismorethanleakagepoweroftheprocessofhighersize.Especiallywith45nmandbelow,theleakagepowerismorebecauseof increasedelectricfield.Tocounterthisproblem,newmaterialswerediscoveredandemployed.Silicondioxidehasbeenusedasagateoxidematerial fordecades.Thetablein Figure2.2 comparesvariousparametersofsilicon onthreedifferentprocesstechnologies.Inthetable90nmistakenasreferenceorbaseline.Pleasenotethatthetablecomparestheparametersin termsofmultiplierandthevaluesarenotabsolute.Itshouldalsobenoted thatthevaluesarearoughestimatebecausetheseparameterswillbeinfluencedbyotherfactorsaswell.Thekeytakeawayorthepointtonotefrom

Parameter/technology 90 nm65 nm45 nm

Node length (nm)1x0.7x0.5x

Frequency (GHz)1x1.43x2x

power (W)1x0.7x0.5x

Dynamic power density (W/cm2) 1x 1.4x2x

Leakage power density (W/cm2) 1x 2.5x6.5x

Power density(W/cm2) 1x 2x 4x

n FIGURE2.2 Comparisonofvariousparametersdrivenbyprocessin90,65,and45nm.

thetableisthattheleakagepowerisgrowingfasterthandevicelength goingdown.

So,anumberofpower-savingmechanismsareappliedacrossdesign,architecture,andprocesstosavedynamicpower:

n MultipleVdd:

– Staticvoltagescaling.InanSoC,differentblockscanworkon differentvoltages,andthelowerthevoltageofablock,theless poweritislikelytoconsume;therefore,itisimperativetocreate multiplevoltagedomains.Tosupportthis,typicallyvoltage regulatorsareusedtocreatedifferentsuppliesfromonesource supply.IPsoperatingononeparticularvoltagewillbeputinthe respectivevoltageisland.So,in Figure2.3,forexample,theIPs operatingon1.2VwillsitonVoltageIsland-1,whiletheIPs operatingon1.8VwillsitonVoltageIsland-3,andsoon.In Figure2.3,theCPUisshowntobeplacedina1.2-Visland, graphicsandtouchina1.8-Visland,audioina1.5V,andeMMC ina1.9-Visland.Thesevoltagelevelsandtheseparationarejust forthesakeofillustration.Theexactnumberofrails/islands dependsonthedesign,IPsbeingused,andtheintendedusage modelofthesystem.

– DVFS.DVFSisatechniqueusedtooptimizepowerconsumptionin differingworkloadscenarios.Inotherwords,theIPisdesignedin suchawaythatitdoesnotconsumefixedpowerallthetime; instead,thepowerconsumptiondependsontheperformancelevel theIPisoperatingon.So,inheavyworkloadscenarios,theIP willbeoperatingonahigherperformancemodeandthereby consuminghigherpower,whileinlighterworkloadscenarios,the IPwillbeoperatingonlowerperformancemodeandthereby consuminglowerpower.Toimplementthis,intheIPdesign,

n FIGURE2.3 Systemwithmultiple Vdd

variousperformancemodesarecreated.Eachoftheperformance modeshasanassociatedoperatingfrequencyandeachofthe operatingfrequencieshasanassociatedvoltagerequirement.So, dependingontheworkload(andtherebytheperformance requirement),thesystemsoftwarecanchoosetheoperatingmode. WhatthisfundamentallymeansisthataparticularIPiscapableof runningonmultiplefrequenciesonrespectivevoltages,andthe softwarewillchoosetherightfrequencyandvoltage,atthepoint. Ageneraldesignmaylooklike Figure2.4.Asshowninthe figure,thesoftwarewillchoosetherightmodeviamodecontrol, whichwilltranslatetorespectivevoltageandfrequencysetting.

n Adaptivevoltagescaling(AVS).AfurtherextensiontoDVFSisAVS, whereinthemodecontrollermonitorsthestate/performance requirementoftheblockandtunesthevoltage/frequencyofthe block.Inthisdesigntheneedforsoftwarecontrolgoesawayand thereforeafinercontrolonDVFSispossible.Agenericblock diagrammaylooklike Figure2.5

Itmustbenotedthatthemodemonitorandcontrollerunitcontinuously keepmonitoringtheIPsindifferentvoltageislandsandregulatethe voltagebasedontheminimumrequired.

n Clockgating.Sincetheclocktreeconsumessignificantpower (approximately50%ofdynamicpower)itisimportanttoreducethe powertakenbytheclocktree.Fundamentallyclockgatingmeans stoppingtheclocktoalogicblockwhentheoperationsofthatblock arenotneeded.Thisclockgatingsavesthepowerconsumedbylogic operatingoneachclockedge,powerconsumedbyflops,andthe clocktreeitself.Manyvariationsweredevisedthathavebuiltonthis basicconcept.

Voltage Island-1
n FIGURE2.4 Systemwithmultiple Vdd andDVFS.

n Frequency/voltagethrottling.Thisisoneofthevariationsofclock gating,whereintheclockisnotcompletelyshutoff,butrather, dependingontheperformancerequirementoftheclockfrequency,is adjustedtoalowervaluesuchthattheperformancerequirementis metwithminimumpowerconsumption.Sincethesupplyvoltage requirementdependsontheclockfrequency,thevoltageisalso adjustedappropriatelytoalowvalue,therebyreducingthepower consumptionevenfurther.

n Powergating.Alogicalextensiontoclockgatingispowergating,in whichthecircuitblocksnotinusearetemporarilyturnedoff.By powergatingwebringthevoltagetozerofordevicesnotinuse.For example,ifthemediaIPisonaseparatevoltagerail,thenitcanbe completelyturnedoffwhenthereisnomediaplayback.Thisismade possiblebythemultiplevoltagerailsanddomainsinthedesign. Powergatingsavestheleakagepowerinadditiontodynamicpower. Gettingthisrightrequiressignificanteffort.Therearetworeasonsfor this:

–Sincethetimeittakestobringupthedevicefrompowerofftopower onissignificantandnoticeable,collateralsneedtoaccommodatefor powerdownstateanddefinetheiroperationflowaccordingly.

–Thedevicemaynotbeabletorespondwhenpowereddown,and worse,maycauseundesirableeffectswhenaccessedinpower downstate;theblocksaccessingthepowered-downunitsshould includeamechanismtocheckwhethertheblockcanbeaccessed.

n Processimprovement.Astransistorshavedecreasedinsize,the thicknessofthesilicondioxidegatedielectrichassteadilydecreased toincreasethegatecapacitanceandtherebydrivecurrentandraise deviceperformance.Asthethicknessscalesbelow2nm,leakage

Voltage Island-1 Voltage Island-2
Voltage Island-3 Mode monitor and controller Voltage regulator
n FIGURE2.5 Systemwithadaptivevoltagescaling.

currentsduetotunnelingincreasedrastically,leadingtohighpower consumptionandreduceddevicereliability.Replacingthesilicon dioxidegatedielectricwithahigh-κ materialallowsincreasedgate capacitancewithouttheassociatedleakageeffects.

So,tosummarizetheabovediscussion:Therearevariouswaysormechanismsemployedtosavepower,andthesemechanismsdonotworkinisolation,butratherhaveinterdependencies.Thereforevariousmechanismsare tweakedandputtogethertominimizeoroptimizethepowerconsumption.

POWERCONSUMPTIONOFASYSTEM

Roughlyspeaking,systemshavetwomodeswhenpoweredon,thefirst beingtheactivemodewhenthesystemisactivelybeingused,thesecond modeofoperationbeingstandbymodewhereinthesystemisonbutis onstandbyandwaitingforinputfromtheuser.Inthestandbymode,tosave power,mostofthesystemcomponentswillbeturnedoffsincetheyareidle. Toeffectivelymanagethepowerandstatetransition,theAdvancedConfigurationandPowerInterface(ACPI)standarddefinesvarioussystemstates anddevicestatesindetail.Generallyspeaking,thedevice/IPisnonfunctionalinlowpowerstates.Inordertousethedevice/IPagain,oneneeds tobringthedevice/IPbacktoafunctionalstatefromthelowpowernonfunctionalstate.Thetimetakenintheprocessiscalledwake-uplatency.Again,a generalruleofthumbis,thelowerthepowerstate,thelongerittakesto bringthedevice/IPtofullyfunctionalstate(themorethewake-uplatency).

So,speakingofthepowerconsumedbyasystem,asshownin Figure2.6,the totalpowerconsumedisasummationofactivemodepowerconsumption, standby(sleep)modepowerconsumption,andthewake-uppower.Inthe figurethe x-axisrepresentstime,whilethe y-axisrepresentsthepowerconsumedattime x.Wake-uppowerrepresentsthepowerwastedduringwakeup.Inanutshell,therearethreecategoriesofpowerconsumption,andseparatestrategiesareappliedtooptimizeeachoftheminasystem:

1. Powerconsumptioninactivemode

2. Powerconsumptioninstandbymode

3. Powerwastageduringsystemwake

Poweroptimizationatthesystemlevel

Whilediscussingpoweroptimizationatthesystemlevel,wewilldiscussthe optimizationonthethreefronts:activepowermanagement(APM),idle powermanagement,andconnectedstandbypowermanagement.

Turn static files into dynamic content formats.

Create a flipbook
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.