OracleDatabase:BenchmarkingaTrillionEdgesRDFGraphOracleWhitePaperJune,2020Copyright2020,Oracleand/oritsaffiliatesPublic2WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicPURPOSESTATEMENTThisdocumentprovidesanoverviewofhowOracleconductedaonetrillionedgesLUBMbenchmark(LUBM4400k)inSeptember2014withRDFGraphinOracleDatabaseonanOracleExadataDatabaseMachine.
DISCLAIMERThisdocumentinanyform,softwareorprintedmatter,containsproprietaryinformationthatistheexclusivepropertyofOracle.
YouraccesstoanduseofthisconfidentialmaterialissubjecttothetermsandconditionsofyourOraclesoftwarelicenseandserviceagreement,whichhasbeenexecutedandwithwhichyouagreetocomply.
Thisdocumentandinformationcontainedhereinmaynotbedisclosed,copied,reproducedordistributedtoanyoneoutsideOraclewithoutpriorwrittenconsentofOracle.
ThisdocumentisnotpartofyourlicenseagreementnorcanitbeincorporatedintoanycontractualagreementwithOracleoritssubsidiariesoraffiliates.
Thisdocumentisforinformationalpurposesonlyandisintendedsolelytoassistyouinplanningfortheimplementationandupgradeoftheproductfeaturesdescribed.
Itisnotacommitmenttodeliveranymaterial,code,orfunctionality,andshouldnotberelieduponinmakingpurchasingdecisions.
Thedevelopment,release,andtimingofanyfeaturesorfunctionalitydescribedinthisdocumentremainsatthesolediscretionofOracle.
Duetothenatureoftheproductarchitecture,itmaynotbepossibletosafelyincludeallfeaturesdescribedinthisdocumentwithoutriskingsignificantdestabilizationofthecode.
3WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTABLEOFCONTENTSPurposeStatement2Disclaimer2Introduction4ATrillionEdgesRDFGraphBenchmarkonOracleDatabase5TheResults5TheConfiguration5BestPracticesUsed6Databasesettings:6Tuning:6Conclusion64WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicINTRODUCTIONOnetrillionisareallybignumber.
Whatcouldyoustorewithonetrillionfacts1000tweetsforeveryoneofthe1BillionTwitterusers.
770factsabouteveryoneofthe1.
3BillionFacebookusers.
10factsfrom107Billionsensors,locatedsomewhereontheplanet.
400metabolicreadingsforeachofthe2.
5Billionheartbeatsoveranaveragehumanlifetime.
12factsabouteveryoneofthe86Billionneuronsinthehumanbrain.
5factsabouteveryoneofthe200BillionstarsintheMilkyWayGalaxy.
7factsabouteveryoneofthe150Billiongalaxiesintheuniverse.
6,350factsabouteachofthe158MillionbooksintheLibraryofCongress,thelargestintheworld.
10factsabouteachofthe107BillionpeoplewhoeverlivedResourceDescriptionFramework(RDF)graphsandtheanalyticstheypermitarebecomingcentraltobigdataapplicationsforsocialnetworksandlinkeddata.
Theseapplicationsareoftenfoundinpublicsector,healthcareandlifesciences,finance,media,andintelligencecommunities.
TheWorldWideWebConsortium(W3C)1definesRDFandtheWebOntologyLanguage(OWL)graphstandardsforrepresentinganddefiningsemanticdataandrules,andSPARQL,apatternmatchingquerylanguagedesignedspecificallyforgraphanalysis.
ThebasicnatureofanRDFgraphfacilitatesidentification,integration,anddiscovery:RDFdataelementsaregloballyunique.
TheyaredefinedusingUniformResourceIdentifiers(URIs)thatenableaconsistentmetadatalayerforintegrationofdisparatedatasources.
RDFdataelementsarelinkedtoformagraph.
Elementsareusedtomakestatementsintheformofsubject-predicate-objecttriples.
Predicates(edges)linkthesubjectandobject(nodes)andcandescribeanyrelationshiporproperty.
Theobjectcanbeanothersubjecttolinktriplestogethertoformagraphoraliteralthatisanattributeofthesubject.
Thetriplescanbefurtherqualifiedwithafourthnamedgraphcomponent,whicharereferredtoasRDFquads.
TheRDFmodelallowseasy,dynamicschemaevolution.
Addinganewschemaelementisaseasyasinsertingatriplewithanewpredicate.
RDFandSPARQLsupportadhocqueries.
Queriesmaynotbeknownwhentheschemaisdesigned.
TheRDFmodelmakesanOpenWorldAssumptionthatcanfacilitatediscovery.
Itassumesthatwhatisunknownisundefined,ratherthanfalse,asisthecasewithrelationaltechnology.
Italsohastechnologiesthathelpdiscovermissingresults.
RDFembedssemantics(meaning)directlyinthedata.
Entitiesarecategorizedwithclasses,predicatesarepropertiesorrelationships,andtheyareallpartofthedata,unlikecolumnheaders,foreignkeys,orconstraintsinrelationaldata.
RDFsupportsmachine-driveninferencingfordiscovery.
TheOWLsemanticlanguageandrulesusedtodefinethepredicatesintriplesarebasedonformalDescriptionLogicsthatenableautomaticdiscovery,suchasidentifying"same-as"relationshipsbetweendifferenttermswiththesamemeaningintwoapplications.
Thesetofinferredtriples(conclusionsthatcanbedrawn)isreferredtoasanentailment.
TheOWLlanguagecanunifyanenterprise'sdictionaries,vocabularies,andtaxonomies.
Allofthetermsusedbytheapplicationsinanenterprisecanberelatedtoeachotherandformconcepts.
Conceptsaremanagedasoneormoredomain-specificontologiesandstoredinRDFgraphs.
Ontologiesarelinkedtotheassertedinstancedataingraphsandusedforinferencingandquerying.
Thisisanothercapabilitythatfacilitatescreatingaconsistentmetadatalayerfordataintegration.
1http://www.
w3.
org/RDF/5WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTheLehighUniversityBenchmark2(LUBM)isadefactoindustrystandardbenchmarkforevaluatingRDFgraphstoreproductperformance.
ItisusedbyRDFgraphstorevendorstocharacterizetheload,inference,andqueryperformanceoftheirproduct.
VendorspostresultsontheW3CLargeTripleStorespage3.
EndusersuseLUBMbenchmarkresultsaspartoftheirevaluationofanRDFGraphstoreproduct.
ThebenchmarkincludesaW3COWL-baseduniversityontology,adatageneratortocreateagraphofanysize,andfourteentestqueries.
OracleconductedaonetrillionedgesLUBMbenchmark(LUBM4400k)inSeptember2014withOracleDatabase12.
1.
0.
1standardinstallationonExadataDatabaseMachineandachievedtworecord-settingaccomplishments:OraclebelievesitsbenchmarkisthelargestcompleteLUBMbenchmarkintheindustrytodate.
Thecombinedload,inference,andqueryresultsarethefastestRDFgraphperformancenumbersreported;thisisespeciallysignificantforabenchmarkofthisscaleandcomplexity.
Thedetailsforthisbenchmark,includingresults,configuration,andbestpracticesarediscussedinthenextsectionofthispaper.
ATRILLIONEDGESRDFGRAPHBENCHMARKONORACLEDATABASEAsbigdatagraphsgrowfrombillionstotrillionsofrelationshipsitbecomesincreasinglyimportanttocharacterizeproductperformance.
OracleconductedanRDFgraphLUBM4400kbenchmark.
Itinvolvedloading,inferencing,andqueryingoveronetrillionedgeswithRDFGraphinOracleDatabaseonanOracleExadataDatabaseMachine.
TheLUBMenvironmentwasusedtogeneratedataaboutuniversitiesandtheirdepartments.
Thedatawascreatedandorderedinto4.
4millionnamedgraphsbyexpandingthetriplesintoquads.
Therewasonenamedgraphperuniversity.
Theoverallgraphincluded605.
4billionuniqueassertedquadsandanentailmentofanother475.
6+billionquads.
TheResultsTheRDFGraphLUBM4400kbenchmarkonOracleDatabaseachievedthefollowingresults:DataLoadingPerformance:1.
420millionQuadsLoadedandIndexedperSecond.
605.
4BillionQuadswereloadedandtwoindexeswerecreatedin115.
2hours.
Note:GraphloadinginOracleDatabaseisuniqueintheindustryforcheckingthatquadsarewellformedandforremovingduplicates.
InferencePerformance:1.
527millionTriplesInferredandIndexedperSecond.
475.
6BillionTriplesandtwoindexeswerecreatedin86.
5hours.
SPARQLQueryPerformance:1.
130MillionQueryResultsperSecond.
92.
5BillionAnswersweregeneratedin22.
5hours.
ATrilliontriplesgraphAssertedInferredTotalAnswers605.
4BillionQuads475.
6BillionTriples1.
081TrillionQuads92.
5BillionTheConfigurationThemarket-leadingperformanceofthisbenchmarkwasduetothecombinationofthenativeRDFgraphstorecapabilitiesofRDFGraphinOracleDatabaseonthebalancedconfigurationofanOracleExadataDatabaseMachineX4-2.
TheuniquecapabilitiesoftheExadataDatabaseMachinethatassistedbenchmarkqueryperformanceinclude:SmartScanthatreducesdatamovementbetweenstorageservers(cells)anddatabaseserverbypushingqueriesdowntothestoragecell,storageindexesusedbythestoragecelltoreadonlyregionsofstoragethathaverelevantdata,andInfiniBandfabricthatprovidesfasttransfer(40Gb/second)ofrelevantbytesbacktothedatabaseservertocompletetheexecutionofaquery.
2http://swat.
cse.
lehigh.
edu/projects/lubm/3https://www.
w3.
org/wiki/LargeTripleStores6WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicTheOracleExadataDatabaseMachineX4-2Highcapacityfullrackwasconfiguredasfollows:8databasenodesand14storagenodesforatotalof168CPUcores2TBtotalRAMand44.
8TBFlashCacheZS3-2storagewith2controllersand8traysofdisksSoftware:OracleDatabase12.
1.
0.
1standardinstallationonExadataDatabaseMachine.
BestPracticesUsedThebestpracticesfallintotwocategories,databasesettingsandtuning.
Databasesettings:SGA_TARGET=132GBPGA_AGGREGATE_TARGET=100GOpencursors=1000Processes=100032KblocksizegiventoallgraphtablespacesaTEMPgroupcreatedwith3bigfiletablespacesUseoftheauto-allocateoptionforallocationoftablespaceextentscoupledwithalarge,8millionbytesextentsize.
ThisreducedthenumberofwaitscausedbyHVenqueuecontention;thatis,waitsonalockthatisusedtoalterthehigh-watermarkinatablespace.
Asaresult,contentionamongmultipleprocessesrequestingtablespaceexpansioncouldbeavoided.
DOPsettings(296,256,192)forautomaticdegreesofparallelismusedinloading,inferencing,andquerying.
UseofadditionalcompressionbeyondbasictablecompressionduringinferencingprovidedbytheHybridColumnarCompressionfeatureofOracleExadataDatabaseMachine.
Tuning:OracleEnterpriseManagerprovidedspecificperformanceinsightsintooperationsfortuning.
ThemethodologyusedisdocumentedintheOracleDatabasePerformanceTuningGuide.
4CONCLUSIONRDFgraphsprovideunique,standards-based,bigdatacapabilitiesformetadataintegration,anddiscoverytosupportsocialnetworksandlinkeddataapplicationsinavarietyofindustries.
RDFGraphdemonstratedindustry-leadingscalabilityandperformanceforloading,inference,andqueryingaonetrillionedgesRDFgraphmanagedinOracleDatabase.
TheLUBM4400kRDFgraphbenchmarkbenefitedfromthebalancedhardwareconfigurationofanOracleExadataDatabaseMachineX4-2.
Thebestpracticessettingsusedtoachievethesebenchmarkresultsarealsogenerallyapplicabletoreal-worldapplicationsonOracleExadataDatabaseMachineandotherbalancedhardwareconfigurations.
4http://docs.
oracle.
com/database/121/TGDBA/toc.
htm7WHITEPAPER|OracleDatabase:BenchmarkingATrillionEdgesRDFGraphCopyright2020,Oracleand/oritsaffiliates|PublicCONNECTWITHUSCall+1.
800.
ORACLE1orvisitoracle.
com.
OutsideNorthAmerica,findyourlocalofficeatoracle.
com/contact.
blogs.
oracle.
comfacebook.
com/oracletwitter.
com/oracleCopyright2020,Oracleand/oritsaffiliates.
Allrightsreserved.
Thisdocumentisprovidedforinformationpurposesonly,andthecontentshereofaresubjecttochangewithoutnotice.
Thisdocumentisnotwarrantedtobeerror-free,norsubjecttoanyotherwarrantiesorconditions,whetherexpressedorallyorimpliedinlaw,includingimpliedwarrantiesandconditionsofmerchantabilityorfitnessforaparticularpurpose.
Wespecificallydisclaimanyliabilitywithrespecttothisdocument,andnocontractualobligationsareformedeitherdirectlyorindirectlybythisdocument.
Thisdocumentmaynotbereproducedortransmittedinanyformorbyanymeans,electronicormechanical,foranypurpose,withoutourpriorwrittenpermission.
ThisdevicehasnotbeenauthorizedasrequiredbytherulesoftheFederalCommunicationsCommission.
Thisdeviceisnot,andmaynotbe,offeredforsaleorlease,orsoldorleased,untilauthorizationisobtained.
OracleandJavaareregisteredtrademarksofOracleand/oritsaffiliates.
Othernamesmaybetrademarksoftheirrespectiveowners.
IntelandIntelXeonaretrademarksorregisteredtrademarksofIntelCorporation.
AllSPARCtrademarksareusedunderlicenseandaretrademarksorregisteredtrademarksofSPARCInternational,Inc.
AMD,Opteron,theAMDlogo,andtheAMDOpteronlogoaretrademarksorregisteredtrademarksofAdvancedMicroDevices.
UNIXisaregisteredtrademarkofTheOpenGroup.
0120OracleDatabase:BenchmarkingaTrillionEdgesRDFGraphJune2020
VPSMS最近在做两周年活动,加上双十一也不久了,商家针对美国洛杉矶CN2 GIA线路VPS主机提供月付6.8折,季付6.2折优惠码,同时活动期间充值800元送150元。这是一家由港人和国人合资开办的VPS主机商,提供基于KVM架构的VPS主机,美国洛杉矶安畅的机器,线路方面电信联通CN2 GIA,移动直连,国内访问速度不错。下面分享几款VPS主机配置信息。CPU:1core内存:512MB硬盘:...
感恩一年有你!免费领取2核4G套餐!2核4G轻量应用服务器2核 CPU 4GB内存 60G SSD云硬盘 6Mbps带宽领取地址:https://cloud.tencent.com/act/pro/lighthousethankyou活动规则活动时间2021年9月23日 ~ 2021年10月23日活动对象腾讯云官网已注册且完成实名认证的国内站用户(协作者与子用户账号除外),且符合以下活动条件:账号...
racknerd怎么样?racknerd最近发布了一些便宜美国服务器促销,包括大硬盘服务器,提供120G SSD+192TB HDD,有AMD和Intel两个选择,默认32G内存,1Gbps带宽,每个月100TB流量,5个IP地址,月付$599。价格非常便宜,需要存储服务器的朋友可以关注一下。RackNerd主要经营美国圣何塞、洛杉矶、达拉斯、芝加哥、亚特兰大、新泽西机房基于KVM虚拟化的VPS、...
oracle数据库视频为你推荐
同ip网站查询同ip地址站点查询 我本地怎么查询不了7788k.comwww.k6320.com 大家给我看看这网站是真是假...haole018.com为什么www.haole008.com在我这里打不开啊,是不是haole008换新的地址了?www.haole012.com012.qq.com是真的吗www.e12.com.cn上海高中除了四大名校,接下来哪所高中最好?顺便讲下它的各方面情况51sese.comwww.51xuanh.com这是什么网站是骗人的吗?朴容熙这个网诺红人叫什么1377.com真实.女友下载地址谁有百度关键字在百度 输入任何关键词,可以搜出想要的内容,但是 搜索工具栏里面的字,却始终是同一个关键词, 如图蜘蛛机器人在《红色警戒2共和国之辉》中,对付“蜘蛛机器人”的最好武器是什么?
根域名服务器 jsp主机 海外服务器 cloudstack 香港cdn 42u机柜尺寸 国外空间服务商 ixwebhosting 阿里云代金券 国外在线代理 免费网站申请 智能骨干网 宁波服务器 789电视 中国电信测网速 国外代理服务器地址 免费活动 美国网站服务器 申请网页 路由跟踪 更多