ontologiesoscommerce

oscommerce  时间:2021-04-12  阅读:()
QALM:aBenchmarkforQuestionAnsweringoverLinkedMerchantWebsitesDataAmineHallili1,ElenaCabrio2,3,andCatherineFaronZucker11Univ.
NiceSophiaAntipolis,CNRS,I3S,UMR7271,SophiaAntipolis,Franceamine.
hallili@inria.
fr;faron@unice.
fr2INRIASophiaAntipolisMediterranee,SophiaAntipolis,Franceelena.
cabrio@inria.
fr3EURECOM,SophiaAntipolis,FranceAbstract.
Thispaperpresentsabenchmarkfortrainingandevaluat-ingQuestionAnsweringSystemsaimingatmediatingbetweenauser,expressinghisorherinformationneedsinnaturallanguage,andseman-ticdatainthecommercialdomainofthemobilephonesindustry.
WerstdescribetheRDFdatasetweextractedthroughtheAPIsofmer-chantwebsites,andtheschemasonwhichitrelies.
Wethenpresentthemethodologyweappliedtocreateasetofnaturallanguagequestionsexpressingpossibleuserneedsintheabovementioneddomain.
Suchquestionsethasthenbeenfurtherannotatedbothwiththecorrespond-ingSPARQLqueries,andwiththecorrectanswersretrievedfromthedataset.
1IntroductionTheevolutionofthee-commercedomain,especiallytheBusinessToClient(B2C),hasencouragedtheimplementationandtheuseofdedicatedapplica-tions(e.
g.
QuestionAnsweringSystems)tryingtoprovideend-userswithabet-terexperience.
Atthesametime,theuser'sneedsaregettingmoreandmorecomplexandspecic,especiallywhenitcomestocommercialproductswhosequestionsconcernmoreoftentheirtechnicalaspects(e.
g.
price,color,seller,etc.
).
Severalsystemsareproposingsolutionstoanswertotheseneeds,butmanychal-lengeshavenotbeenovercomeyet,leavingroomforimprovement.
Forinstance,federatingseveralcommercialknowledgebasesinoneknowledgebasehasnotbeenaccomplishedyet.
Also,understandingandinterpretingcomplexnaturallanguagequestionsalsoknownasn-relationquestionsseemstobeoneoftheambitioustopicsthatsystemsarecurrentlytryingtogureout.
InthispaperwepresentabenchmarkfortrainingandevaluatingQuestionAnswering(QA)Systemsaimingatmediatingbetweenauser,expressinghisorherinformationneedinnaturallanguage,andsemanticdatainthecommercialdomainofthemobilephoneindustry.
WerstdescribetheRDFdatasetthatwehaveextractedthroughtheAPIsofmerchantsites,andtheschemasonwhichitrelies.
Wethenpresentthemethodologyweappliedtocreateasetofnaturallan-guagequestionsexpressingpossibleuserneedsintheabovementioneddomain.
SuchquestionsethasthenbefurtherannotatedbothwiththecorrespondingSPARQLqueries,andwiththecorrectanswersretrievedfromthedataset.
2AMerchantSitesDatasetfortheMobilePhonesIndustryThissectiondescribestheQALM(QuestionAnsweringoverLinkedMerchantwebsites)ontology(Section2.
1),andtheRDFdataset(Section2.
2)webuiltbyextractingasampleofdatafromasetofcommercialwebsites.
2.
1QALMOntologyTheQALMRDFdatasetreliesontwoontologies:theMerchantSiteOntology(MSO)andthePhoneOntology(PO).
TogethertheybuilduptheQALMOn-tology.
4MSOmodelsgeneralconceptsofmerchantwebsites,anditisalignedtothecommercialpartoftheSchema.
orgontology.
MSOiscomposedof5classes:mso:Product,mso:Seller,mso:Organization,mso:Store,mso:ParcelDelive-ry,andof29properties(e.
g.
mso:price,mso:url,mso:location,mso:seller)declaredassubclassesandsubpropertiesofSchema.
orgclassesandproperties.
Weaddedtothemmultilinguallabels(bothinEnglishandinFrench),thatcanbeexploitedbyQAsystemsinparticularforpropertyidenticationinthequestioninterpretationstep.
WereliedonWordNetsynonyms[2]toextractasmuchlabelsaspossible.
Forexample,thepropertymso:pricehasthefollowingEnglishlabels:"price","cost","value","tari","amount",andthefollowingFrenchlabels:"prix","cout","couter","valoir","tarif","s'elever".
POisadomainontologymodelingconceptsspecictothephoneindus-try.
Itiscomposedof7classes(e.
g.
po:Phone,po:Accessory)whicharede-claredassubclassesofmso:Product,andof35properties(e.
g.
po:handsetType,po:operatingSystem,po:phoneStyle).
2.
2QALMRDFDatasetOurnalgoalistobuildauniedRDFdatasetintegratingcommercialproductdescriptionsfromvariouse-commercewebsites.
Inordertoachievethisgoal,weanalyzethewebservicesofthee-commercewebsitesregardlessoftheirtype(eitherSOAPorREST).
Tofeedourdataset,wecreateamappingbetweentheremotecallstothewebservicesandtheontologyproperties,thatwestoreinaseparateleforreuse.
Inparticular,webuilttheQALMRDFdatasetbyextractingdatafromeBay5andBestBuy6commercialwebsitesthroughBestBuyWebserviceandeBayAPI.
TheextractedrawdataistransformedintoRDFtriplesbyapplyingtheabovedescribedmappingbetweentheQALMontology4Availableatwww.
i3s.
unice.
fr/qalm/ontology5http://www.
ebay.
com/6http://www.
bestbuy.
com/andtheAPI/webservice.
Forinstance,themethodgetPrice()intheeBayAPIismappedtothepropertymso:priceintheQALMontology.
Currently,theQALMdatasetcomprises500000productdescriptionsandupto15millionstriplesextractedfromeBayandBestBuy.
73QALMQuestionSetInordertotrainandtoevaluateaQAsystemmediatingbetweenauserandsemanticdataintheQALMdataset,asetofquestionsrepresentingusersre-questsinthephoneindustrydomainisrequired.
Uptoourknowledge,theonlyavailablestandardsetsofquestionstoevaluateQAsystemsoverlinkeddataaretheonesreleasedbytheorganizersoftheQALD(QuestionAnsweringoverLinkedData)challenges.
8HoweversuchquestionsareovertheEnglishDBpediadataset9,andthereforecoverseveraltopics.
Forthisreason,wecreatedasetofnaturallanguagequestionsforthespeciccommercialdomainofthephoneindustry,followingtheguidelinesdescribedbytheQALDorganizersforthecreationoftheirquestionsets[1].
Morespecically,thesequestionswerecre-atedby12externalpeople(studentsandresearchersinothergroups)withnobackgroundinquestionanswering,inordertoavoidabiastowardsaparticularapproach.
Toaccomplishthetaskofquestioncreation,eachpersonwasgiveni)thelistoftheproducttypespresentintheQALMdataset(mainlycomposedofITproductsasphonesandaccessories);ii)thelistofthepropertiesoftheQALMontologypresentedasproductfeaturesinwhichtheycouldbeinterestedin;andtheywereaskedtoproducei)both1-relationand2-relationquestions,andii)atleast5questionseach.
Thequestionsweredesignedtopresentpotentialuserquestionsandtoincludeawiderangeofchallengessuchaslexicalambiguitiesandcomplexsyntacticalstructures.
SuchquestionswerethenannotatedwiththecorrespondingSPARQLqueries,andthecorrectanswersretrievedfromthedataset,inordertoconsiderthemasareliablegoldstandardforourbenchmark.
Thenalquestionsetcomprises70questions;itisdividedintoatrainingset10andatestsetofrespectively40and30questions.
AnnotationsareprovidedinXMLformat,andaccordingtoQALDguidelines,thefollowingattributesarespeciedforeachquestionalongwithitsID:aggregation(indicateswhetheranyoperationbeyondtriplepatternmatchingisrequiredtoanswerthequestion,e.
g.
,counting,ltering,ordering),answertype(givestheanswertype:resource,string,boolean,double,date).
Wealsoaddedtheattributerelations,toindicatewhetherthequestionisconnectedtoitsanswerthroughoneormorepropertiesoftheontology(values:1,n).
Finally,foreachquestionthecorrespondingSPARQLqueryisprovided,aswellastheanswersthisqueryreturns.
Examples1and2showsomequestionsfromthecollectedquestionset,connectedtotheiranswersthrough1propertyormorethan1propertyoftheontology,respectively.
In7Availableatwww.
i3s.
unice.
fr/QALM/qalm.
rdf8http://greententacle.
techfak.
uni-bielefeld.
de/~cunger/qald/9http://dbpedia.
org10Availableatwww.
i3s.
unice.
fr/QALM/training_questions.
xmlparticular,questions14and50fromExample2requirealsotocarryoutsomereasoningontheresults,inordertorankthemandtoproducethecorrectanswer.
Example1.
1-relationquestions.
id=36.
Givemethemanufacturerswhosupplyon-earheadphones.
id=52.
WhatcolorsareavailablefortheSamsungGalaxy5id=61.
WhichproductsofAlcatelareavailableonlineExample2.
n-relationsquestions.
id=14.
Whichcellphonecase(anymanufacturer)hasthemostratingsid=50.
WhatisthehighestcameraresolutionofphonesmanufacturedbyMotorolaid=58.
IwouldliketoknowinwhichstoresIcanbuyApplephones.
4ConclusionsandOngoingWorkThispaperpresentedabenchmarktotrainandtestQAsystems,composedofi)theQALMontologies;ii)theQALMRDFdatasetofproductdescriptionsex-tractedfromeBayandBestBuy;andiii)theQALMQuestionSet,containing70naturallanguagequestionsinthecommercialdomainofphonesandaccessories.
Asforfuturework,wewillconsideraligningtheQALMontologytotheGoodRelationsontologytofullycoverthecommercialdomain,andtobenetfromthesemanticscapturedinthisontology.
WealsoconsiderimprovingtheQALMRDFdatasetbyi)extractingRDFdatafromadditionalcommercialwebsitesthatprovidewebservicesorAPIs;andii)directlyextractingRDFdataintheSchema.
orgontologyfromcommercialwebsiteswhosepagesareautomaticallygeneratedwithSchema.
orgmarkup(e.
g.
Magento,OSCommerce,Genesis2.
0,Prestashop),toextendthenumberofaddressedcommercialwebsites.
Inparallel,wearecurrentlydevelopingtheSynchroBotQAsystem[3],anontology-basedchatbotforthee-commercedomain.
WewillevaluateitbyusingtheproposedQALMbenchmark.
AcknowledgementsWethankAmazon,eBayandBestBuyforcontributingtothisworkbysharingwithuspublicdataabouttheircommercialproducts.
TheworkofE.
CabriowasfundedbytheFrenchGovernmentthroughtheANR-11-LABX-0031-01program.
References1.
Cimiano,P.
,Lopez,V.
,Unger,C.
,Cabrio,E.
,Ngomo,A.
C.
N.
,Walter,S.
:Multi-lingualquestionansweringoverlinkeddata(qald-3):Laboverview.
In:CLEF.
pp.
321–332(2013)2.
Fellbaum,C.
:WordNet:AnElectronicLexicalDatabase.
BradfordBooks(1998)3.
Hallili,A.
:Towardanontology-basedchatbotendowedwithnaturallanguagepro-cessingandgeneration.
In:Proc.
ofESSLLI2014-StudentSession,Posterpaper(2014)

可抵御99%的攻击中国单域版cdn:9元/月7T防御 cloudsecre

官方网站:点击访问CDN客服QQ:123008公司名:贵州青辞赋文化传媒有限公司域名和IP被墙封了怎么办?用cloudsecre.com网站被攻击了怎么办?用cloudsecre.com问:黑客为什么要找网站来攻击?答:黑客需要找肉鸡。问:什么是肉鸡?答:被控的服务器和电脑主机就是肉鸡。问:肉鸡有什么作用?答:肉鸡的作用非常多,可以用来干违法的事情,通常的行为有:VPN拨号,流量P2P,攻击傀儡,...

日本美国站群服务器raksmart站群新增,限量低至月1.99美元

RAKsmart 商家八月份的促销活动今天更新。基本上和上个月的产品套餐活动差不多的,不过也是有简单的微调。对于RAKsmart商家还是比较了解的,他们家产品虽然这两年增加多个机房,以及在VPS主机方案上有丰富的机房和调整到一些自营机房,他们家的策划能力还是有限,基本上每个月的套餐活动都差不多。RAKsmart 在八月份看到有新增香港高防服务器可选,最高100GB防御。同时原来上个月缺货的日本独立...

819云互联(800元/月),香港BGP E5 2650 16G,日本 E5 2650 16G

819云互联 在本月发布了一个购买香港,日本独立服务器的活动,相对之前的首月活动性价比更高,最多只能享受1个月的活动 续费价格恢复原价 是有些颇高 这次819云互联与机房是合作伙伴 本次拿到机房 活动7天内购买独立服务器后期的长期续费价格 加大力度 确实来说这次的就可以买年付或者更长时间了…本次是5个机房可供选择,独立服务器最低默认是50M带宽,不限制流量,。官网:https://ww...

oscommerce为你推荐
proceduresns支持ipad波音737起飞爆胎飞机会爆胎的吗?传奇域名谁能帮忙查查传奇的IP和域名powerbydedecms如何去掉织梦网站底部的powered by dedecms方法discuz教程Discuz怎样使用?引擎收录要让搜索引擎收录一张图片需要达到什么条件 ?joomla教程php100视频教程超级用户在电脑上如何设置超级用户(Administrator)?搜索引擎优化教程seo关键词优化教程 seo搜索优化教程
韩国vps俄罗斯美女 查询ip地址 sugarhosts diahosting pccw godaddy域名转出 光棍节日志 轻博 好看qq空间 服务器硬件防火墙 免费私人服务器 永久免费空间 阿里云邮箱登陆 hostease 此网页包含的内容将不使用安全的https 德国代理ip ddos攻击器下载 免费网络电视软件 最好的空间留言 杭州机动车摇号申请网站 更多