OracleDatabase:BenchmarkingaTrillionEdgesRDFGraphOracleWhitePaperJune,2020Copyright2020,Oracleand/oritsaffiliatesPublic2WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicPURPOSESTATEMENTThisdocumentprovidesanoverviewofhowOracleconductedaonetrillionedgesLUBMbenchmark(LUBM4400k)inSeptember2014withRDFGraphinOracleDatabaseonanOracleExadataDatabaseMachine.
DISCLAIMERThisdocumentinanyform,softwareorprintedmatter,containsproprietaryinformationthatistheexclusivepropertyofOracle.
YouraccesstoanduseofthisconfidentialmaterialissubjecttothetermsandconditionsofyourOraclesoftwarelicenseandserviceagreement,whichhasbeenexecutedandwithwhichyouagreetocomply.
Thisdocumentandinformationcontainedhereinmaynotbedisclosed,copied,reproducedordistributedtoanyoneoutsideOraclewithoutpriorwrittenconsentofOracle.
ThisdocumentisnotpartofyourlicenseagreementnorcanitbeincorporatedintoanycontractualagreementwithOracleoritssubsidiariesoraffiliates.
Thisdocumentisforinformationalpurposesonlyandisintendedsolelytoassistyouinplanningfortheimplementationandupgradeoftheproductfeaturesdescribed.
Itisnotacommitmenttodeliveranymaterial,code,orfunctionality,andshouldnotberelieduponinmakingpurchasingdecisions.
Thedevelopment,release,andtimingofanyfeaturesorfunctionalitydescribedinthisdocumentremainsatthesolediscretionofOracle.
Duetothenatureoftheproductarchitecture,itmaynotbepossibletosafelyincludeallfeaturesdescribedinthisdocumentwithoutriskingsignificantdestabilizationofthecode.
3WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTABLEOFCONTENTSPurposeStatement2Disclaimer2Introduction4ATrillionEdgesRDFGraphBenchmarkonOracleDatabase5TheResults5TheConfiguration5BestPracticesUsed6Databasesettings:6Tuning:6Conclusion64WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicINTRODUCTIONOnetrillionisareallybignumber.
Whatcouldyoustorewithonetrillionfacts1000tweetsforeveryoneofthe1BillionTwitterusers.
770factsabouteveryoneofthe1.
3BillionFacebookusers.
10factsfrom107Billionsensors,locatedsomewhereontheplanet.
400metabolicreadingsforeachofthe2.
5Billionheartbeatsoveranaveragehumanlifetime.
12factsabouteveryoneofthe86Billionneuronsinthehumanbrain.
5factsabouteveryoneofthe200BillionstarsintheMilkyWayGalaxy.
7factsabouteveryoneofthe150Billiongalaxiesintheuniverse.
6,350factsabouteachofthe158MillionbooksintheLibraryofCongress,thelargestintheworld.
10factsabouteachofthe107BillionpeoplewhoeverlivedResourceDescriptionFramework(RDF)graphsandtheanalyticstheypermitarebecomingcentraltobigdataapplicationsforsocialnetworksandlinkeddata.
Theseapplicationsareoftenfoundinpublicsector,healthcareandlifesciences,finance,media,andintelligencecommunities.
TheWorldWideWebConsortium(W3C)1definesRDFandtheWebOntologyLanguage(OWL)graphstandardsforrepresentinganddefiningsemanticdataandrules,andSPARQL,apatternmatchingquerylanguagedesignedspecificallyforgraphanalysis.
ThebasicnatureofanRDFgraphfacilitatesidentification,integration,anddiscovery:RDFdataelementsaregloballyunique.
TheyaredefinedusingUniformResourceIdentifiers(URIs)thatenableaconsistentmetadatalayerforintegrationofdisparatedatasources.
RDFdataelementsarelinkedtoformagraph.
Elementsareusedtomakestatementsintheformofsubject-predicate-objecttriples.
Predicates(edges)linkthesubjectandobject(nodes)andcandescribeanyrelationshiporproperty.
Theobjectcanbeanothersubjecttolinktriplestogethertoformagraphoraliteralthatisanattributeofthesubject.
Thetriplescanbefurtherqualifiedwithafourthnamedgraphcomponent,whicharereferredtoasRDFquads.
TheRDFmodelallowseasy,dynamicschemaevolution.
Addinganewschemaelementisaseasyasinsertingatriplewithanewpredicate.
RDFandSPARQLsupportadhocqueries.
Queriesmaynotbeknownwhentheschemaisdesigned.
TheRDFmodelmakesanOpenWorldAssumptionthatcanfacilitatediscovery.
Itassumesthatwhatisunknownisundefined,ratherthanfalse,asisthecasewithrelationaltechnology.
Italsohastechnologiesthathelpdiscovermissingresults.
RDFembedssemantics(meaning)directlyinthedata.
Entitiesarecategorizedwithclasses,predicatesarepropertiesorrelationships,andtheyareallpartofthedata,unlikecolumnheaders,foreignkeys,orconstraintsinrelationaldata.
RDFsupportsmachine-driveninferencingfordiscovery.
TheOWLsemanticlanguageandrulesusedtodefinethepredicatesintriplesarebasedonformalDescriptionLogicsthatenableautomaticdiscovery,suchasidentifying"same-as"relationshipsbetweendifferenttermswiththesamemeaningintwoapplications.
Thesetofinferredtriples(conclusionsthatcanbedrawn)isreferredtoasanentailment.
TheOWLlanguagecanunifyanenterprise'sdictionaries,vocabularies,andtaxonomies.
Allofthetermsusedbytheapplicationsinanenterprisecanberelatedtoeachotherandformconcepts.
Conceptsaremanagedasoneormoredomain-specificontologiesandstoredinRDFgraphs.
Ontologiesarelinkedtotheassertedinstancedataingraphsandusedforinferencingandquerying.
Thisisanothercapabilitythatfacilitatescreatingaconsistentmetadatalayerfordataintegration.
1http://www.
w3.
org/RDF/5WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTheLehighUniversityBenchmark2(LUBM)isadefactoindustrystandardbenchmarkforevaluatingRDFgraphstoreproductperformance.
ItisusedbyRDFgraphstorevendorstocharacterizetheload,inference,andqueryperformanceoftheirproduct.
VendorspostresultsontheW3CLargeTripleStorespage3.
EndusersuseLUBMbenchmarkresultsaspartoftheirevaluationofanRDFGraphstoreproduct.
ThebenchmarkincludesaW3COWL-baseduniversityontology,adatageneratortocreateagraphofanysize,andfourteentestqueries.
OracleconductedaonetrillionedgesLUBMbenchmark(LUBM4400k)inSeptember2014withOracleDatabase12.
1.
0.
1standardinstallationonExadataDatabaseMachineandachievedtworecord-settingaccomplishments:OraclebelievesitsbenchmarkisthelargestcompleteLUBMbenchmarkintheindustrytodate.
Thecombinedload,inference,andqueryresultsarethefastestRDFgraphperformancenumbersreported;thisisespeciallysignificantforabenchmarkofthisscaleandcomplexity.
Thedetailsforthisbenchmark,includingresults,configuration,andbestpracticesarediscussedinthenextsectionofthispaper.
ATRILLIONEDGESRDFGRAPHBENCHMARKONORACLEDATABASEAsbigdatagraphsgrowfrombillionstotrillionsofrelationshipsitbecomesincreasinglyimportanttocharacterizeproductperformance.
OracleconductedanRDFgraphLUBM4400kbenchmark.
Itinvolvedloading,inferencing,andqueryingoveronetrillionedgeswithRDFGraphinOracleDatabaseonanOracleExadataDatabaseMachine.
TheLUBMenvironmentwasusedtogeneratedataaboutuniversitiesandtheirdepartments.
Thedatawascreatedandorderedinto4.
4millionnamedgraphsbyexpandingthetriplesintoquads.
Therewasonenamedgraphperuniversity.
Theoverallgraphincluded605.
4billionuniqueassertedquadsandanentailmentofanother475.
6+billionquads.
TheResultsTheRDFGraphLUBM4400kbenchmarkonOracleDatabaseachievedthefollowingresults:DataLoadingPerformance:1.
420millionQuadsLoadedandIndexedperSecond.
605.
4BillionQuadswereloadedandtwoindexeswerecreatedin115.
2hours.
Note:GraphloadinginOracleDatabaseisuniqueintheindustryforcheckingthatquadsarewellformedandforremovingduplicates.
InferencePerformance:1.
527millionTriplesInferredandIndexedperSecond.
475.
6BillionTriplesandtwoindexeswerecreatedin86.
5hours.
SPARQLQueryPerformance:1.
130MillionQueryResultsperSecond.
92.
5BillionAnswersweregeneratedin22.
5hours.
ATrilliontriplesgraphAssertedInferredTotalAnswers605.
4BillionQuads475.
6BillionTriples1.
081TrillionQuads92.
5BillionTheConfigurationThemarket-leadingperformanceofthisbenchmarkwasduetothecombinationofthenativeRDFgraphstorecapabilitiesofRDFGraphinOracleDatabaseonthebalancedconfigurationofanOracleExadataDatabaseMachineX4-2.
TheuniquecapabilitiesoftheExadataDatabaseMachinethatassistedbenchmarkqueryperformanceinclude:SmartScanthatreducesdatamovementbetweenstorageservers(cells)anddatabaseserverbypushingqueriesdowntothestoragecell,storageindexesusedbythestoragecelltoreadonlyregionsofstoragethathaverelevantdata,andInfiniBandfabricthatprovidesfasttransfer(40Gb/second)ofrelevantbytesbacktothedatabaseservertocompletetheexecutionofaquery.
2http://swat.
cse.
lehigh.
edu/projects/lubm/3https://www.
w3.
org/wiki/LargeTripleStores6WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTheOracleExadataDatabaseMachineX4-2Highcapacityfullrackwasconfiguredasfollows:8databasenodesand14storagenodesforatotalof168CPUcores2TBtotalRAMand44.
8TBFlashCacheZS3-2storagewith2controllersand8traysofdisksSoftware:OracleDatabase12.
1.
0.
1standardinstallationonExadataDatabaseMachine.
BestPracticesUsedThebestpracticesfallintotwocategories,databasesettingsandtuning.
Databasesettings:SGA_TARGET=132GBPGA_AGGREGATE_TARGET=100GOpencursors=1000Processes=100032KblocksizegiventoallgraphtablespacesaTEMPgroupcreatedwith3bigfiletablespacesUseoftheauto-allocateoptionforallocationoftablespaceextentscoupledwithalarge,8millionbytesextentsize.
ThisreducedthenumberofwaitscausedbyHVenqueuecontention;thatis,waitsonalockthatisusedtoalterthehigh-watermarkinatablespace.
Asaresult,contentionamongmultipleprocessesrequestingtablespaceexpansioncouldbeavoided.
DOPsettings(296,256,192)forautomaticdegreesofparallelismusedinloading,inferencing,andquerying.
UseofadditionalcompressionbeyondbasictablecompressionduringinferencingprovidedbytheHybridColumnarCompressionfeatureofOracleExadataDatabaseMachine.
Tuning:OracleEnterpriseManagerprovidedspecificperformanceinsightsintooperationsfortuning.
ThemethodologyusedisdocumentedintheOracleDatabasePerformanceTuningGuide.
4CONCLUSIONRDFgraphsprovideunique,standards-based,bigdatacapabilitiesformetadataintegration,anddiscoverytosupportsocialnetworksandlinkeddataapplicationsinavarietyofindustries.
RDFGraphdemonstratedindustry-leadingscalabilityandperformanceforloading,inference,andqueryingaonetrillionedgesRDFgraphmanagedinOracleDatabase.
TheLUBM4400kRDFgraphbenchmarkbenefitedfromthebalancedhardwareconfigurationofanOracleExadataDatabaseMachineX4-2.
Thebestpracticessettingsusedtoachievethesebenchmarkresultsarealsogenerallyapplicabletoreal-worldapplicationsonOracleExadataDatabaseMachineandotherbalancedhardwareconfigurations.
4http://docs.
oracle.
com/database/121/TGDBA/toc.
htm7WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicCONNECTWITHUSCall+1.
800.
ORACLE1orvisitoracle.
com.
OutsideNorthAmerica,findyourlocalofficeatoracle.
com/contact.
blogs.
oracle.
comfacebook.
com/oracletwitter.
com/oracleCopyright2020,Oracleand/oritsaffiliates.
Allrightsreserved.
Thisdocumentisprovidedforinformationpurposesonly,andthecontentshereofaresubjecttochangewithoutnotice.
Thisdocumentisnotwarrantedtobeerror-free,norsubjecttoanyotherwarrantiesorconditions,whetherexpressedorallyorimpliedinlaw,includingimpliedwarrantiesandconditionsofmerchantabilityorfitnessforaparticularpurpose.
Wespecificallydisclaimanyliabilitywithrespecttothisdocument,andnocontractualobligationsareformedeitherdirectlyorindirectlybythisdocument.
Thisdocumentmaynotbereproducedortransmittedinanyformorbyanymeans,electronicormechanical,foranypurpose,withoutourpriorwrittenpermission.
ThisdevicehasnotbeenauthorizedasrequiredbytherulesoftheFederalCommunicationsCommission.
Thisdeviceisnot,andmaynotbe,offeredforsaleorlease,orsoldorleased,untilauthorizationisobtained.
OracleandJavaareregisteredtrademarksofOracleand/oritsaffiliates.
Othernamesmaybetrademarksoftheirrespectiveowners.
IntelandIntelXeonaretrademarksorregisteredtrademarksofIntelCorporation.
AllSPARCtrademarksareusedunderlicenseandaretrademarksorregisteredtrademarksofSPARCInternational,Inc.
AMD,Opteron,theAMDlogo,andtheAMDOpteronlogoaretrademarksorregisteredtrademarksofAdvancedMicroDevices.
UNIXisaregisteredtrademarkofTheOpenGroup.
0120OracleDatabase:BenchmarkingaTrillionEdgesRDFGraphJune2020
PacificRack最近促销上瘾了,活动频繁,接二连三的追加便宜VPS秒杀,PacificRack在 7月中下旬已经推出了五款秒杀VPS套餐,现在商家又新增了三款更便宜的特价套餐,年付低至7.2美元,这已经是本月第三波促销,带宽都是1Gbps。PacificRack 7月秒杀VPS整个系列都是PR-M,也就是魔方的后台管理。2G内存起步的支持Windows 7、10、Server 2003\20...
这个月11号ShockHosting发了个新上日本东京机房的邮件,并且表示其他机房可以申请转移到日本,刚好赵容手里有个美国的也没数据就发工单申请新开了一个,这里做个简单的测试,方便大家参考。ShockHosting成立于2013年,目前提供的VPS主机可以选择11个数据中心,包括美国洛杉矶、芝加哥、达拉斯、杰克逊维尔、新泽西、澳大利亚、新加坡、日本、荷兰和英国等。官方网站:https://shoc...
轻云互联成立于2018年的国人商家,广州轻云互联网络科技有限公司旗下品牌,主要从事VPS、虚拟主机等云计算产品业务,适合建站、新手上车的值得选择,香港三网直连(电信CN2GIA联通移动CN2直连);美国圣何塞(回程三网CN2GIA)线路,所有产品均采用KVM虚拟技术架构,高效售后保障,稳定多年,高性能可用,网络优质,为您的业务保驾护航。官方网站:点击进入广州轻云网络科技有限公司活动规则:1.用户购...
oracle数据库视频为你推荐
易烊千玺弟弟创魔方世界纪录王俊凯和王源两人和易烊千玺的弟弟玩过吗金评媒朱江请问朱江恺撒堡KX系列的钢琴怎么样?李子柒年入1.6亿李子柒男朋友是谁,李子柒父母怎么去世的?lunwenjiance知网论文检测查重系统月神谭求男变女类的变身小说seo优化工具想找一个效果好的SEO优化软件使用,在网上找了几款不知道哪款好,想请大家帮忙出主意,用浙江哪款软件效果好5xoy.com求个如月群真汉化版下载地址www.javmoo.comjavimdb怎么看lcoc.top服装英语中double topstitches什么意思partnersonlinecashfiesta 该怎么使用啊~~
新世界机房 site5 hostmonster cdn服务器 香港cdn nerd 网络星期一 光棍节日志 服务器日志分析 php免费空间 速度云 腾讯实名认证中心 免费智能解析 vip域名 raid10 好看的空间 linux命令vi ddos攻击软件 电脑主机噪音大 厦门电信网上营业厅 更多