OracleDatabase:BenchmarkingaTrillionEdgesRDFGraphOracleWhitePaperJune,2020Copyright2020,Oracleand/oritsaffiliatesPublic2WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicPURPOSESTATEMENTThisdocumentprovidesanoverviewofhowOracleconductedaonetrillionedgesLUBMbenchmark(LUBM4400k)inSeptember2014withRDFGraphinOracleDatabaseonanOracleExadataDatabaseMachine.
DISCLAIMERThisdocumentinanyform,softwareorprintedmatter,containsproprietaryinformationthatistheexclusivepropertyofOracle.
YouraccesstoanduseofthisconfidentialmaterialissubjecttothetermsandconditionsofyourOraclesoftwarelicenseandserviceagreement,whichhasbeenexecutedandwithwhichyouagreetocomply.
Thisdocumentandinformationcontainedhereinmaynotbedisclosed,copied,reproducedordistributedtoanyoneoutsideOraclewithoutpriorwrittenconsentofOracle.
ThisdocumentisnotpartofyourlicenseagreementnorcanitbeincorporatedintoanycontractualagreementwithOracleoritssubsidiariesoraffiliates.
Thisdocumentisforinformationalpurposesonlyandisintendedsolelytoassistyouinplanningfortheimplementationandupgradeoftheproductfeaturesdescribed.
Itisnotacommitmenttodeliveranymaterial,code,orfunctionality,andshouldnotberelieduponinmakingpurchasingdecisions.
Thedevelopment,release,andtimingofanyfeaturesorfunctionalitydescribedinthisdocumentremainsatthesolediscretionofOracle.
Duetothenatureoftheproductarchitecture,itmaynotbepossibletosafelyincludeallfeaturesdescribedinthisdocumentwithoutriskingsignificantdestabilizationofthecode.
3WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTABLEOFCONTENTSPurposeStatement2Disclaimer2Introduction4ATrillionEdgesRDFGraphBenchmarkonOracleDatabase5TheResults5TheConfiguration5BestPracticesUsed6Databasesettings:6Tuning:6Conclusion64WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicINTRODUCTIONOnetrillionisareallybignumber.
Whatcouldyoustorewithonetrillionfacts1000tweetsforeveryoneofthe1BillionTwitterusers.
770factsabouteveryoneofthe1.
3BillionFacebookusers.
10factsfrom107Billionsensors,locatedsomewhereontheplanet.
400metabolicreadingsforeachofthe2.
5Billionheartbeatsoveranaveragehumanlifetime.
12factsabouteveryoneofthe86Billionneuronsinthehumanbrain.
5factsabouteveryoneofthe200BillionstarsintheMilkyWayGalaxy.
7factsabouteveryoneofthe150Billiongalaxiesintheuniverse.
6,350factsabouteachofthe158MillionbooksintheLibraryofCongress,thelargestintheworld.
10factsabouteachofthe107BillionpeoplewhoeverlivedResourceDescriptionFramework(RDF)graphsandtheanalyticstheypermitarebecomingcentraltobigdataapplicationsforsocialnetworksandlinkeddata.
Theseapplicationsareoftenfoundinpublicsector,healthcareandlifesciences,finance,media,andintelligencecommunities.
TheWorldWideWebConsortium(W3C)1definesRDFandtheWebOntologyLanguage(OWL)graphstandardsforrepresentinganddefiningsemanticdataandrules,andSPARQL,apatternmatchingquerylanguagedesignedspecificallyforgraphanalysis.
ThebasicnatureofanRDFgraphfacilitatesidentification,integration,anddiscovery:RDFdataelementsaregloballyunique.
TheyaredefinedusingUniformResourceIdentifiers(URIs)thatenableaconsistentmetadatalayerforintegrationofdisparatedatasources.
RDFdataelementsarelinkedtoformagraph.
Elementsareusedtomakestatementsintheformofsubject-predicate-objecttriples.
Predicates(edges)linkthesubjectandobject(nodes)andcandescribeanyrelationshiporproperty.
Theobjectcanbeanothersubjecttolinktriplestogethertoformagraphoraliteralthatisanattributeofthesubject.
Thetriplescanbefurtherqualifiedwithafourthnamedgraphcomponent,whicharereferredtoasRDFquads.
TheRDFmodelallowseasy,dynamicschemaevolution.
Addinganewschemaelementisaseasyasinsertingatriplewithanewpredicate.
RDFandSPARQLsupportadhocqueries.
Queriesmaynotbeknownwhentheschemaisdesigned.
TheRDFmodelmakesanOpenWorldAssumptionthatcanfacilitatediscovery.
Itassumesthatwhatisunknownisundefined,ratherthanfalse,asisthecasewithrelationaltechnology.
Italsohastechnologiesthathelpdiscovermissingresults.
RDFembedssemantics(meaning)directlyinthedata.
Entitiesarecategorizedwithclasses,predicatesarepropertiesorrelationships,andtheyareallpartofthedata,unlikecolumnheaders,foreignkeys,orconstraintsinrelationaldata.
RDFsupportsmachine-driveninferencingfordiscovery.
TheOWLsemanticlanguageandrulesusedtodefinethepredicatesintriplesarebasedonformalDescriptionLogicsthatenableautomaticdiscovery,suchasidentifying"same-as"relationshipsbetweendifferenttermswiththesamemeaningintwoapplications.
Thesetofinferredtriples(conclusionsthatcanbedrawn)isreferredtoasanentailment.
TheOWLlanguagecanunifyanenterprise'sdictionaries,vocabularies,andtaxonomies.
Allofthetermsusedbytheapplicationsinanenterprisecanberelatedtoeachotherandformconcepts.
Conceptsaremanagedasoneormoredomain-specificontologiesandstoredinRDFgraphs.
Ontologiesarelinkedtotheassertedinstancedataingraphsandusedforinferencingandquerying.
Thisisanothercapabilitythatfacilitatescreatingaconsistentmetadatalayerfordataintegration.
1http://www.
w3.
org/RDF/5WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTheLehighUniversityBenchmark2(LUBM)isadefactoindustrystandardbenchmarkforevaluatingRDFgraphstoreproductperformance.
ItisusedbyRDFgraphstorevendorstocharacterizetheload,inference,andqueryperformanceoftheirproduct.
VendorspostresultsontheW3CLargeTripleStorespage3.
EndusersuseLUBMbenchmarkresultsaspartoftheirevaluationofanRDFGraphstoreproduct.
ThebenchmarkincludesaW3COWL-baseduniversityontology,adatageneratortocreateagraphofanysize,andfourteentestqueries.
OracleconductedaonetrillionedgesLUBMbenchmark(LUBM4400k)inSeptember2014withOracleDatabase12.
1.
0.
1standardinstallationonExadataDatabaseMachineandachievedtworecord-settingaccomplishments:OraclebelievesitsbenchmarkisthelargestcompleteLUBMbenchmarkintheindustrytodate.
Thecombinedload,inference,andqueryresultsarethefastestRDFgraphperformancenumbersreported;thisisespeciallysignificantforabenchmarkofthisscaleandcomplexity.
Thedetailsforthisbenchmark,includingresults,configuration,andbestpracticesarediscussedinthenextsectionofthispaper.
ATRILLIONEDGESRDFGRAPHBENCHMARKONORACLEDATABASEAsbigdatagraphsgrowfrombillionstotrillionsofrelationshipsitbecomesincreasinglyimportanttocharacterizeproductperformance.
OracleconductedanRDFgraphLUBM4400kbenchmark.
Itinvolvedloading,inferencing,andqueryingoveronetrillionedgeswithRDFGraphinOracleDatabaseonanOracleExadataDatabaseMachine.
TheLUBMenvironmentwasusedtogeneratedataaboutuniversitiesandtheirdepartments.
Thedatawascreatedandorderedinto4.
4millionnamedgraphsbyexpandingthetriplesintoquads.
Therewasonenamedgraphperuniversity.
Theoverallgraphincluded605.
4billionuniqueassertedquadsandanentailmentofanother475.
6+billionquads.
TheResultsTheRDFGraphLUBM4400kbenchmarkonOracleDatabaseachievedthefollowingresults:DataLoadingPerformance:1.
420millionQuadsLoadedandIndexedperSecond.
605.
4BillionQuadswereloadedandtwoindexeswerecreatedin115.
2hours.
Note:GraphloadinginOracleDatabaseisuniqueintheindustryforcheckingthatquadsarewellformedandforremovingduplicates.
InferencePerformance:1.
527millionTriplesInferredandIndexedperSecond.
475.
6BillionTriplesandtwoindexeswerecreatedin86.
5hours.
SPARQLQueryPerformance:1.
130MillionQueryResultsperSecond.
92.
5BillionAnswersweregeneratedin22.
5hours.
ATrilliontriplesgraphAssertedInferredTotalAnswers605.
4BillionQuads475.
6BillionTriples1.
081TrillionQuads92.
5BillionTheConfigurationThemarket-leadingperformanceofthisbenchmarkwasduetothecombinationofthenativeRDFgraphstorecapabilitiesofRDFGraphinOracleDatabaseonthebalancedconfigurationofanOracleExadataDatabaseMachineX4-2.
TheuniquecapabilitiesoftheExadataDatabaseMachinethatassistedbenchmarkqueryperformanceinclude:SmartScanthatreducesdatamovementbetweenstorageservers(cells)anddatabaseserverbypushingqueriesdowntothestoragecell,storageindexesusedbythestoragecelltoreadonlyregionsofstoragethathaverelevantdata,andInfiniBandfabricthatprovidesfasttransfer(40Gb/second)ofrelevantbytesbacktothedatabaseservertocompletetheexecutionofaquery.
2http://swat.
cse.
lehigh.
edu/projects/lubm/3https://www.
w3.
org/wiki/LargeTripleStores6WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTheOracleExadataDatabaseMachineX4-2Highcapacityfullrackwasconfiguredasfollows:8databasenodesand14storagenodesforatotalof168CPUcores2TBtotalRAMand44.
8TBFlashCacheZS3-2storagewith2controllersand8traysofdisksSoftware:OracleDatabase12.
1.
0.
1standardinstallationonExadataDatabaseMachine.
BestPracticesUsedThebestpracticesfallintotwocategories,databasesettingsandtuning.
Databasesettings:SGA_TARGET=132GBPGA_AGGREGATE_TARGET=100GOpencursors=1000Processes=100032KblocksizegiventoallgraphtablespacesaTEMPgroupcreatedwith3bigfiletablespacesUseoftheauto-allocateoptionforallocationoftablespaceextentscoupledwithalarge,8millionbytesextentsize.
ThisreducedthenumberofwaitscausedbyHVenqueuecontention;thatis,waitsonalockthatisusedtoalterthehigh-watermarkinatablespace.
Asaresult,contentionamongmultipleprocessesrequestingtablespaceexpansioncouldbeavoided.
DOPsettings(296,256,192)forautomaticdegreesofparallelismusedinloading,inferencing,andquerying.
UseofadditionalcompressionbeyondbasictablecompressionduringinferencingprovidedbytheHybridColumnarCompressionfeatureofOracleExadataDatabaseMachine.
Tuning:OracleEnterpriseManagerprovidedspecificperformanceinsightsintooperationsfortuning.
ThemethodologyusedisdocumentedintheOracleDatabasePerformanceTuningGuide.
4CONCLUSIONRDFgraphsprovideunique,standards-based,bigdatacapabilitiesformetadataintegration,anddiscoverytosupportsocialnetworksandlinkeddataapplicationsinavarietyofindustries.
RDFGraphdemonstratedindustry-leadingscalabilityandperformanceforloading,inference,andqueryingaonetrillionedgesRDFgraphmanagedinOracleDatabase.
TheLUBM4400kRDFgraphbenchmarkbenefitedfromthebalancedhardwareconfigurationofanOracleExadataDatabaseMachineX4-2.
Thebestpracticessettingsusedtoachievethesebenchmarkresultsarealsogenerallyapplicabletoreal-worldapplicationsonOracleExadataDatabaseMachineandotherbalancedhardwareconfigurations.
4http://docs.
oracle.
com/database/121/TGDBA/toc.
htm7WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicCONNECTWITHUSCall+1.
800.
ORACLE1orvisitoracle.
com.
OutsideNorthAmerica,findyourlocalofficeatoracle.
com/contact.
blogs.
oracle.
comfacebook.
com/oracletwitter.
com/oracleCopyright2020,Oracleand/oritsaffiliates.
Allrightsreserved.
Thisdocumentisprovidedforinformationpurposesonly,andthecontentshereofaresubjecttochangewithoutnotice.
Thisdocumentisnotwarrantedtobeerror-free,norsubjecttoanyotherwarrantiesorconditions,whetherexpressedorallyorimpliedinlaw,includingimpliedwarrantiesandconditionsofmerchantabilityorfitnessforaparticularpurpose.
Wespecificallydisclaimanyliabilitywithrespecttothisdocument,andnocontractualobligationsareformedeitherdirectlyorindirectlybythisdocument.
Thisdocumentmaynotbereproducedortransmittedinanyformorbyanymeans,electronicormechanical,foranypurpose,withoutourpriorwrittenpermission.
ThisdevicehasnotbeenauthorizedasrequiredbytherulesoftheFederalCommunicationsCommission.
Thisdeviceisnot,andmaynotbe,offeredforsaleorlease,orsoldorleased,untilauthorizationisobtained.
OracleandJavaareregisteredtrademarksofOracleand/oritsaffiliates.
Othernamesmaybetrademarksoftheirrespectiveowners.
IntelandIntelXeonaretrademarksorregisteredtrademarksofIntelCorporation.
AllSPARCtrademarksareusedunderlicenseandaretrademarksorregisteredtrademarksofSPARCInternational,Inc.
AMD,Opteron,theAMDlogo,andtheAMDOpteronlogoaretrademarksorregisteredtrademarksofAdvancedMicroDevices.
UNIXisaregisteredtrademarkofTheOpenGroup.
0120OracleDatabase:BenchmarkingaTrillionEdgesRDFGraphJune2020
青云互联怎么样?青云互联是一家成立于2020年的主机服务商,致力于为用户提供高性价比稳定快速的主机托管服务,目前提供有美国免费主机、香港主机、韩国服务器、香港服务器、美国云服务器,香港安畅cn2弹性云限时首月五折,15元/月起;可选Windows/可自定义配置,让您的网站高速、稳定运行。点击进入:青云互联官方网站地址青云互联优惠码:八折优惠码:ltY8sHMh (续费同价)青云互联香港云服务器活动...
spinservers是Majestic Hosting Solutions LLC旗下站点,商家提供国外服务器租用和Hybrid Dedicated等产品,数据中心包括美国达拉斯和圣何塞机房,机器默认10Gbps端口带宽,高配置硬件,支持使用PayPal、信用卡、支付宝或者微信等付款方式。农历春节之际,商家推出了几款特别促销配置,最低双路E5-2630Lv3机器每月149美元起,下面列出几款机器...
reliablesite怎么样?reliablesite是一家于2006年成立的老牌美国主机商,主要提供独服,数据中心有迈阿密、纽约、洛杉矶等,均免费提供20Gbps DDoS防护,150TB月流量,1Gbps带宽。月付19美金可升级为10Gbps带宽。洛杉矶/纽约/迈阿密等机房,E3-1240V6/64GB内存/1TB SSD硬盘/DDOS/150TB流量/1Gbps带宽/DDOS,$95/月,...
oracle数据库视频为你推荐
京沪高铁上市首秀哪些企业建设京沪高铁?Baby被问婚变绯闻黄晓明baby一起出来带娃,想要打破离婚传闻?access数据库access数据库的组成是什么22zizi.com河南福利彩票22选52010175开奖结果杨丽晓博客明星的最新博文ww.66bobo.com这个www.中国应急救援网.com查询证件是真是假?www.javlibrary.com跪求一个JAVHD.com的帐号hao.rising.cn电脑每次开机的时候,都会弹出“http://hao.rising.cn/?b=34” 但是这个时bihaiyinshaBailo 是什么意思?干支论坛干支计时的干支计时
泛域名 加勒比群岛 国外php主机 56折 云主机51web 免费网络电视 网通服务器ip 炎黄盛世 新天域互联 赞助 免费dns解析 33456 国外视频网站有哪些 starry 下载速度测试 智能dns解析 国外在线代理服务器 阿里云邮箱登陆地址 免费蓝钻 xuni 更多