OracleDatabase:BenchmarkingaTrillionEdgesRDFGraphOracleWhitePaperJune,2020Copyright2020,Oracleand/oritsaffiliatesPublic2WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicPURPOSESTATEMENTThisdocumentprovidesanoverviewofhowOracleconductedaonetrillionedgesLUBMbenchmark(LUBM4400k)inSeptember2014withRDFGraphinOracleDatabaseonanOracleExadataDatabaseMachine.
DISCLAIMERThisdocumentinanyform,softwareorprintedmatter,containsproprietaryinformationthatistheexclusivepropertyofOracle.
YouraccesstoanduseofthisconfidentialmaterialissubjecttothetermsandconditionsofyourOraclesoftwarelicenseandserviceagreement,whichhasbeenexecutedandwithwhichyouagreetocomply.
Thisdocumentandinformationcontainedhereinmaynotbedisclosed,copied,reproducedordistributedtoanyoneoutsideOraclewithoutpriorwrittenconsentofOracle.
ThisdocumentisnotpartofyourlicenseagreementnorcanitbeincorporatedintoanycontractualagreementwithOracleoritssubsidiariesoraffiliates.
Thisdocumentisforinformationalpurposesonlyandisintendedsolelytoassistyouinplanningfortheimplementationandupgradeoftheproductfeaturesdescribed.
Itisnotacommitmenttodeliveranymaterial,code,orfunctionality,andshouldnotberelieduponinmakingpurchasingdecisions.
Thedevelopment,release,andtimingofanyfeaturesorfunctionalitydescribedinthisdocumentremainsatthesolediscretionofOracle.
Duetothenatureoftheproductarchitecture,itmaynotbepossibletosafelyincludeallfeaturesdescribedinthisdocumentwithoutriskingsignificantdestabilizationofthecode.
3WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTABLEOFCONTENTSPurposeStatement2Disclaimer2Introduction4ATrillionEdgesRDFGraphBenchmarkonOracleDatabase5TheResults5TheConfiguration5BestPracticesUsed6Databasesettings:6Tuning:6Conclusion64WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicINTRODUCTIONOnetrillionisareallybignumber.
Whatcouldyoustorewithonetrillionfacts1000tweetsforeveryoneofthe1BillionTwitterusers.
770factsabouteveryoneofthe1.
3BillionFacebookusers.
10factsfrom107Billionsensors,locatedsomewhereontheplanet.
400metabolicreadingsforeachofthe2.
5Billionheartbeatsoveranaveragehumanlifetime.
12factsabouteveryoneofthe86Billionneuronsinthehumanbrain.
5factsabouteveryoneofthe200BillionstarsintheMilkyWayGalaxy.
7factsabouteveryoneofthe150Billiongalaxiesintheuniverse.
6,350factsabouteachofthe158MillionbooksintheLibraryofCongress,thelargestintheworld.
10factsabouteachofthe107BillionpeoplewhoeverlivedResourceDescriptionFramework(RDF)graphsandtheanalyticstheypermitarebecomingcentraltobigdataapplicationsforsocialnetworksandlinkeddata.
Theseapplicationsareoftenfoundinpublicsector,healthcareandlifesciences,finance,media,andintelligencecommunities.
TheWorldWideWebConsortium(W3C)1definesRDFandtheWebOntologyLanguage(OWL)graphstandardsforrepresentinganddefiningsemanticdataandrules,andSPARQL,apatternmatchingquerylanguagedesignedspecificallyforgraphanalysis.
ThebasicnatureofanRDFgraphfacilitatesidentification,integration,anddiscovery:RDFdataelementsaregloballyunique.
TheyaredefinedusingUniformResourceIdentifiers(URIs)thatenableaconsistentmetadatalayerforintegrationofdisparatedatasources.
RDFdataelementsarelinkedtoformagraph.
Elementsareusedtomakestatementsintheformofsubject-predicate-objecttriples.
Predicates(edges)linkthesubjectandobject(nodes)andcandescribeanyrelationshiporproperty.
Theobjectcanbeanothersubjecttolinktriplestogethertoformagraphoraliteralthatisanattributeofthesubject.
Thetriplescanbefurtherqualifiedwithafourthnamedgraphcomponent,whicharereferredtoasRDFquads.
TheRDFmodelallowseasy,dynamicschemaevolution.
Addinganewschemaelementisaseasyasinsertingatriplewithanewpredicate.
RDFandSPARQLsupportadhocqueries.
Queriesmaynotbeknownwhentheschemaisdesigned.
TheRDFmodelmakesanOpenWorldAssumptionthatcanfacilitatediscovery.
Itassumesthatwhatisunknownisundefined,ratherthanfalse,asisthecasewithrelationaltechnology.
Italsohastechnologiesthathelpdiscovermissingresults.
RDFembedssemantics(meaning)directlyinthedata.
Entitiesarecategorizedwithclasses,predicatesarepropertiesorrelationships,andtheyareallpartofthedata,unlikecolumnheaders,foreignkeys,orconstraintsinrelationaldata.
RDFsupportsmachine-driveninferencingfordiscovery.
TheOWLsemanticlanguageandrulesusedtodefinethepredicatesintriplesarebasedonformalDescriptionLogicsthatenableautomaticdiscovery,suchasidentifying"same-as"relationshipsbetweendifferenttermswiththesamemeaningintwoapplications.
Thesetofinferredtriples(conclusionsthatcanbedrawn)isreferredtoasanentailment.
TheOWLlanguagecanunifyanenterprise'sdictionaries,vocabularies,andtaxonomies.
Allofthetermsusedbytheapplicationsinanenterprisecanberelatedtoeachotherandformconcepts.
Conceptsaremanagedasoneormoredomain-specificontologiesandstoredinRDFgraphs.
Ontologiesarelinkedtotheassertedinstancedataingraphsandusedforinferencingandquerying.
Thisisanothercapabilitythatfacilitatescreatingaconsistentmetadatalayerfordataintegration.
1http://www.
w3.
org/RDF/5WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTheLehighUniversityBenchmark2(LUBM)isadefactoindustrystandardbenchmarkforevaluatingRDFgraphstoreproductperformance.
ItisusedbyRDFgraphstorevendorstocharacterizetheload,inference,andqueryperformanceoftheirproduct.
VendorspostresultsontheW3CLargeTripleStorespage3.
EndusersuseLUBMbenchmarkresultsaspartoftheirevaluationofanRDFGraphstoreproduct.
ThebenchmarkincludesaW3COWL-baseduniversityontology,adatageneratortocreateagraphofanysize,andfourteentestqueries.
OracleconductedaonetrillionedgesLUBMbenchmark(LUBM4400k)inSeptember2014withOracleDatabase12.
1.
0.
1standardinstallationonExadataDatabaseMachineandachievedtworecord-settingaccomplishments:OraclebelievesitsbenchmarkisthelargestcompleteLUBMbenchmarkintheindustrytodate.
Thecombinedload,inference,andqueryresultsarethefastestRDFgraphperformancenumbersreported;thisisespeciallysignificantforabenchmarkofthisscaleandcomplexity.
Thedetailsforthisbenchmark,includingresults,configuration,andbestpracticesarediscussedinthenextsectionofthispaper.
ATRILLIONEDGESRDFGRAPHBENCHMARKONORACLEDATABASEAsbigdatagraphsgrowfrombillionstotrillionsofrelationshipsitbecomesincreasinglyimportanttocharacterizeproductperformance.
OracleconductedanRDFgraphLUBM4400kbenchmark.
Itinvolvedloading,inferencing,andqueryingoveronetrillionedgeswithRDFGraphinOracleDatabaseonanOracleExadataDatabaseMachine.
TheLUBMenvironmentwasusedtogeneratedataaboutuniversitiesandtheirdepartments.
Thedatawascreatedandorderedinto4.
4millionnamedgraphsbyexpandingthetriplesintoquads.
Therewasonenamedgraphperuniversity.
Theoverallgraphincluded605.
4billionuniqueassertedquadsandanentailmentofanother475.
6+billionquads.
TheResultsTheRDFGraphLUBM4400kbenchmarkonOracleDatabaseachievedthefollowingresults:DataLoadingPerformance:1.
420millionQuadsLoadedandIndexedperSecond.
605.
4BillionQuadswereloadedandtwoindexeswerecreatedin115.
2hours.
Note:GraphloadinginOracleDatabaseisuniqueintheindustryforcheckingthatquadsarewellformedandforremovingduplicates.
InferencePerformance:1.
527millionTriplesInferredandIndexedperSecond.
475.
6BillionTriplesandtwoindexeswerecreatedin86.
5hours.
SPARQLQueryPerformance:1.
130MillionQueryResultsperSecond.
92.
5BillionAnswersweregeneratedin22.
5hours.
ATrilliontriplesgraphAssertedInferredTotalAnswers605.
4BillionQuads475.
6BillionTriples1.
081TrillionQuads92.
5BillionTheConfigurationThemarket-leadingperformanceofthisbenchmarkwasduetothecombinationofthenativeRDFgraphstorecapabilitiesofRDFGraphinOracleDatabaseonthebalancedconfigurationofanOracleExadataDatabaseMachineX4-2.
TheuniquecapabilitiesoftheExadataDatabaseMachinethatassistedbenchmarkqueryperformanceinclude:SmartScanthatreducesdatamovementbetweenstorageservers(cells)anddatabaseserverbypushingqueriesdowntothestoragecell,storageindexesusedbythestoragecelltoreadonlyregionsofstoragethathaverelevantdata,andInfiniBandfabricthatprovidesfasttransfer(40Gb/second)ofrelevantbytesbacktothedatabaseservertocompletetheexecutionofaquery.
2http://swat.
cse.
lehigh.
edu/projects/lubm/3https://www.
w3.
org/wiki/LargeTripleStores6WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTheOracleExadataDatabaseMachineX4-2Highcapacityfullrackwasconfiguredasfollows:8databasenodesand14storagenodesforatotalof168CPUcores2TBtotalRAMand44.
8TBFlashCacheZS3-2storagewith2controllersand8traysofdisksSoftware:OracleDatabase12.
1.
0.
1standardinstallationonExadataDatabaseMachine.
BestPracticesUsedThebestpracticesfallintotwocategories,databasesettingsandtuning.
Databasesettings:SGA_TARGET=132GBPGA_AGGREGATE_TARGET=100GOpencursors=1000Processes=100032KblocksizegiventoallgraphtablespacesaTEMPgroupcreatedwith3bigfiletablespacesUseoftheauto-allocateoptionforallocationoftablespaceextentscoupledwithalarge,8millionbytesextentsize.
ThisreducedthenumberofwaitscausedbyHVenqueuecontention;thatis,waitsonalockthatisusedtoalterthehigh-watermarkinatablespace.
Asaresult,contentionamongmultipleprocessesrequestingtablespaceexpansioncouldbeavoided.
DOPsettings(296,256,192)forautomaticdegreesofparallelismusedinloading,inferencing,andquerying.
UseofadditionalcompressionbeyondbasictablecompressionduringinferencingprovidedbytheHybridColumnarCompressionfeatureofOracleExadataDatabaseMachine.
Tuning:OracleEnterpriseManagerprovidedspecificperformanceinsightsintooperationsfortuning.
ThemethodologyusedisdocumentedintheOracleDatabasePerformanceTuningGuide.
4CONCLUSIONRDFgraphsprovideunique,standards-based,bigdatacapabilitiesformetadataintegration,anddiscoverytosupportsocialnetworksandlinkeddataapplicationsinavarietyofindustries.
RDFGraphdemonstratedindustry-leadingscalabilityandperformanceforloading,inference,andqueryingaonetrillionedgesRDFgraphmanagedinOracleDatabase.
TheLUBM4400kRDFgraphbenchmarkbenefitedfromthebalancedhardwareconfigurationofanOracleExadataDatabaseMachineX4-2.
Thebestpracticessettingsusedtoachievethesebenchmarkresultsarealsogenerallyapplicabletoreal-worldapplicationsonOracleExadataDatabaseMachineandotherbalancedhardwareconfigurations.
4http://docs.
oracle.
com/database/121/TGDBA/toc.
htm7WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicCONNECTWITHUSCall+1.
800.
ORACLE1orvisitoracle.
com.
OutsideNorthAmerica,findyourlocalofficeatoracle.
com/contact.
blogs.
oracle.
comfacebook.
com/oracletwitter.
com/oracleCopyright2020,Oracleand/oritsaffiliates.
Allrightsreserved.
Thisdocumentisprovidedforinformationpurposesonly,andthecontentshereofaresubjecttochangewithoutnotice.
Thisdocumentisnotwarrantedtobeerror-free,norsubjecttoanyotherwarrantiesorconditions,whetherexpressedorallyorimpliedinlaw,includingimpliedwarrantiesandconditionsofmerchantabilityorfitnessforaparticularpurpose.
Wespecificallydisclaimanyliabilitywithrespecttothisdocument,andnocontractualobligationsareformedeitherdirectlyorindirectlybythisdocument.
Thisdocumentmaynotbereproducedortransmittedinanyformorbyanymeans,electronicormechanical,foranypurpose,withoutourpriorwrittenpermission.
ThisdevicehasnotbeenauthorizedasrequiredbytherulesoftheFederalCommunicationsCommission.
Thisdeviceisnot,andmaynotbe,offeredforsaleorlease,orsoldorleased,untilauthorizationisobtained.
OracleandJavaareregisteredtrademarksofOracleand/oritsaffiliates.
Othernamesmaybetrademarksoftheirrespectiveowners.
IntelandIntelXeonaretrademarksorregisteredtrademarksofIntelCorporation.
AllSPARCtrademarksareusedunderlicenseandaretrademarksorregisteredtrademarksofSPARCInternational,Inc.
AMD,Opteron,theAMDlogo,andtheAMDOpteronlogoaretrademarksorregisteredtrademarksofAdvancedMicroDevices.
UNIXisaregisteredtrademarkofTheOpenGroup.
0120OracleDatabase:BenchmarkingaTrillionEdgesRDFGraphJune2020
弘速云是创建于2021年的品牌,运营该品牌的公司HOSU LIMITED(中文名称弘速科技有限公司)公司成立于2021年国内公司注册于2019年。HOSU LIMITED主要从事出售香港VPS、美国VPS、香港独立服务器、香港站群服务器等,目前在售VPS线路有CN2+BGP、CN2 GIA,该公司旗下产品均采用KVM虚拟化架构。可联系商家代安装iso系统。国庆活动 优惠码:hosu10-1产品介绍...
ZJI原名维翔主机,是原来Wordpress圈知名主机商家,成立于2011年,2018年9月更名为ZJI,提供香港、日本、美国独立服务器(自营/数据中心直营)租用及VDS、虚拟主机空间、域名注册业务。ZJI今年全新上架了台湾CN2线路服务器,本月针对香港高主频服务器和台湾CN2服务器提供7折优惠码,其他机房及产品提供8折优惠码,优惠后台湾CN2线路E5服务器月付595元起。台湾一型CPU:Inte...
Digital-VM商家的暑期活动促销,这个商家提供有多个数据中心独立服务器、VPS主机产品。最低配置月付80美元,支持带宽、流量和IP的自定义配置。Digital-VM,是2019年新成立的商家,主要从事日本东京、新加坡、美国洛杉矶、荷兰阿姆斯特丹、西班牙马德里、挪威奥斯陆、丹麦哥本哈根数据中心的KVM架构VPS产品销售,分为大硬盘型(1Gbps带宽端口、分配较大的硬盘)和大带宽型(10Gbps...
oracle数据库视频为你推荐
急救知识纳入考试应急救护知识应该由哪个部门培训12306崩溃12306网站显示异常,什么原因啊7788k.comwww.k6320.com 大家给我看看这网站是真是假...嘀动网在炫动网买鞋怎么样,是真的吗同一ip网站最近我们网站老是出现同一个IP无数次的进我们网站,而且是在同一时刻,是不是被人刷了?为什么呀?同一ip网站如何用不同的IP同时登陆一个网站杨丽晓博客明星的最新博文www.kaspersky.com.cn现在网上又有病毒了?ww.66bobo.com有的网址直接输入***.com就行了,不用WWW, 为什么?yinrentangzimotang氨基酸洗发水的功效咋样?
长沙虚拟主机 themeforest 私人服务器 softbank官网 天互数据 100m空间 ntfs格式分区 双线主机 hktv gtt 国外ip加速器 web服务器是什么 卡巴斯基试用版下载 北京主机托管 湖南铁通 sonya 内存 装修瓦工培训 kosskeb79 最年轻博士 更多