ontologiesoscommerce
oscommerce 时间:2021-04-12 阅读:(
)
QALM:aBenchmarkforQuestionAnsweringoverLinkedMerchantWebsitesDataAmineHallili1,ElenaCabrio2,3,andCatherineFaronZucker11Univ.
NiceSophiaAntipolis,CNRS,I3S,UMR7271,SophiaAntipolis,Franceamine.
hallili@inria.
fr;faron@unice.
fr2INRIASophiaAntipolisMediterranee,SophiaAntipolis,Franceelena.
cabrio@inria.
fr3EURECOM,SophiaAntipolis,FranceAbstract.
Thispaperpresentsabenchmarkfortrainingandevaluat-ingQuestionAnsweringSystemsaimingatmediatingbetweenauser,expressinghisorherinformationneedsinnaturallanguage,andseman-ticdatainthecommercialdomainofthemobilephonesindustry.
WerstdescribetheRDFdatasetweextractedthroughtheAPIsofmer-chantwebsites,andtheschemasonwhichitrelies.
Wethenpresentthemethodologyweappliedtocreateasetofnaturallanguagequestionsexpressingpossibleuserneedsintheabovementioneddomain.
Suchquestionsethasthenbeenfurtherannotatedbothwiththecorrespond-ingSPARQLqueries,andwiththecorrectanswersretrievedfromthedataset.
1IntroductionTheevolutionofthee-commercedomain,especiallytheBusinessToClient(B2C),hasencouragedtheimplementationandtheuseofdedicatedapplica-tions(e.
g.
QuestionAnsweringSystems)tryingtoprovideend-userswithabet-terexperience.
Atthesametime,theuser'sneedsaregettingmoreandmorecomplexandspecic,especiallywhenitcomestocommercialproductswhosequestionsconcernmoreoftentheirtechnicalaspects(e.
g.
price,color,seller,etc.
).
Severalsystemsareproposingsolutionstoanswertotheseneeds,butmanychal-lengeshavenotbeenovercomeyet,leavingroomforimprovement.
Forinstance,federatingseveralcommercialknowledgebasesinoneknowledgebasehasnotbeenaccomplishedyet.
Also,understandingandinterpretingcomplexnaturallanguagequestionsalsoknownasn-relationquestionsseemstobeoneoftheambitioustopicsthatsystemsarecurrentlytryingtogureout.
InthispaperwepresentabenchmarkfortrainingandevaluatingQuestionAnswering(QA)Systemsaimingatmediatingbetweenauser,expressinghisorherinformationneedinnaturallanguage,andsemanticdatainthecommercialdomainofthemobilephoneindustry.
WerstdescribetheRDFdatasetthatwehaveextractedthroughtheAPIsofmerchantsites,andtheschemasonwhichitrelies.
Wethenpresentthemethodologyweappliedtocreateasetofnaturallan-guagequestionsexpressingpossibleuserneedsintheabovementioneddomain.
SuchquestionsethasthenbefurtherannotatedbothwiththecorrespondingSPARQLqueries,andwiththecorrectanswersretrievedfromthedataset.
2AMerchantSitesDatasetfortheMobilePhonesIndustryThissectiondescribestheQALM(QuestionAnsweringoverLinkedMerchantwebsites)ontology(Section2.
1),andtheRDFdataset(Section2.
2)webuiltbyextractingasampleofdatafromasetofcommercialwebsites.
2.
1QALMOntologyTheQALMRDFdatasetreliesontwoontologies:theMerchantSiteOntology(MSO)andthePhoneOntology(PO).
TogethertheybuilduptheQALMOn-tology.
4MSOmodelsgeneralconceptsofmerchantwebsites,anditisalignedtothecommercialpartoftheSchema.
orgontology.
MSOiscomposedof5classes:mso:Product,mso:Seller,mso:Organization,mso:Store,mso:ParcelDelive-ry,andof29properties(e.
g.
mso:price,mso:url,mso:location,mso:seller)declaredassubclassesandsubpropertiesofSchema.
orgclassesandproperties.
Weaddedtothemmultilinguallabels(bothinEnglishandinFrench),thatcanbeexploitedbyQAsystemsinparticularforpropertyidenticationinthequestioninterpretationstep.
WereliedonWordNetsynonyms[2]toextractasmuchlabelsaspossible.
Forexample,thepropertymso:pricehasthefollowingEnglishlabels:"price","cost","value","tari","amount",andthefollowingFrenchlabels:"prix","cout","couter","valoir","tarif","s'elever".
POisadomainontologymodelingconceptsspecictothephoneindus-try.
Itiscomposedof7classes(e.
g.
po:Phone,po:Accessory)whicharede-claredassubclassesofmso:Product,andof35properties(e.
g.
po:handsetType,po:operatingSystem,po:phoneStyle).
2.
2QALMRDFDatasetOurnalgoalistobuildauniedRDFdatasetintegratingcommercialproductdescriptionsfromvariouse-commercewebsites.
Inordertoachievethisgoal,weanalyzethewebservicesofthee-commercewebsitesregardlessoftheirtype(eitherSOAPorREST).
Tofeedourdataset,wecreateamappingbetweentheremotecallstothewebservicesandtheontologyproperties,thatwestoreinaseparateleforreuse.
Inparticular,webuilttheQALMRDFdatasetbyextractingdatafromeBay5andBestBuy6commercialwebsitesthroughBestBuyWebserviceandeBayAPI.
TheextractedrawdataistransformedintoRDFtriplesbyapplyingtheabovedescribedmappingbetweentheQALMontology4Availableatwww.
i3s.
unice.
fr/qalm/ontology5http://www.
ebay.
com/6http://www.
bestbuy.
com/andtheAPI/webservice.
Forinstance,themethodgetPrice()intheeBayAPIismappedtothepropertymso:priceintheQALMontology.
Currently,theQALMdatasetcomprises500000productdescriptionsandupto15millionstriplesextractedfromeBayandBestBuy.
73QALMQuestionSetInordertotrainandtoevaluateaQAsystemmediatingbetweenauserandsemanticdataintheQALMdataset,asetofquestionsrepresentingusersre-questsinthephoneindustrydomainisrequired.
Uptoourknowledge,theonlyavailablestandardsetsofquestionstoevaluateQAsystemsoverlinkeddataaretheonesreleasedbytheorganizersoftheQALD(QuestionAnsweringoverLinkedData)challenges.
8HoweversuchquestionsareovertheEnglishDBpediadataset9,andthereforecoverseveraltopics.
Forthisreason,wecreatedasetofnaturallanguagequestionsforthespeciccommercialdomainofthephoneindustry,followingtheguidelinesdescribedbytheQALDorganizersforthecreationoftheirquestionsets[1].
Morespecically,thesequestionswerecre-atedby12externalpeople(studentsandresearchersinothergroups)withnobackgroundinquestionanswering,inordertoavoidabiastowardsaparticularapproach.
Toaccomplishthetaskofquestioncreation,eachpersonwasgiveni)thelistoftheproducttypespresentintheQALMdataset(mainlycomposedofITproductsasphonesandaccessories);ii)thelistofthepropertiesoftheQALMontologypresentedasproductfeaturesinwhichtheycouldbeinterestedin;andtheywereaskedtoproducei)both1-relationand2-relationquestions,andii)atleast5questionseach.
Thequestionsweredesignedtopresentpotentialuserquestionsandtoincludeawiderangeofchallengessuchaslexicalambiguitiesandcomplexsyntacticalstructures.
SuchquestionswerethenannotatedwiththecorrespondingSPARQLqueries,andthecorrectanswersretrievedfromthedataset,inordertoconsiderthemasareliablegoldstandardforourbenchmark.
Thenalquestionsetcomprises70questions;itisdividedintoatrainingset10andatestsetofrespectively40and30questions.
AnnotationsareprovidedinXMLformat,andaccordingtoQALDguidelines,thefollowingattributesarespeciedforeachquestionalongwithitsID:aggregation(indicateswhetheranyoperationbeyondtriplepatternmatchingisrequiredtoanswerthequestion,e.
g.
,counting,ltering,ordering),answertype(givestheanswertype:resource,string,boolean,double,date).
Wealsoaddedtheattributerelations,toindicatewhetherthequestionisconnectedtoitsanswerthroughoneormorepropertiesoftheontology(values:1,n).
Finally,foreachquestionthecorrespondingSPARQLqueryisprovided,aswellastheanswersthisqueryreturns.
Examples1and2showsomequestionsfromthecollectedquestionset,connectedtotheiranswersthrough1propertyormorethan1propertyoftheontology,respectively.
In7Availableatwww.
i3s.
unice.
fr/QALM/qalm.
rdf8http://greententacle.
techfak.
uni-bielefeld.
de/~cunger/qald/9http://dbpedia.
org10Availableatwww.
i3s.
unice.
fr/QALM/training_questions.
xmlparticular,questions14and50fromExample2requirealsotocarryoutsomereasoningontheresults,inordertorankthemandtoproducethecorrectanswer.
Example1.
1-relationquestions.
id=36.
Givemethemanufacturerswhosupplyon-earheadphones.
id=52.
WhatcolorsareavailablefortheSamsungGalaxy5id=61.
WhichproductsofAlcatelareavailableonlineExample2.
n-relationsquestions.
id=14.
Whichcellphonecase(anymanufacturer)hasthemostratingsid=50.
WhatisthehighestcameraresolutionofphonesmanufacturedbyMotorolaid=58.
IwouldliketoknowinwhichstoresIcanbuyApplephones.
4ConclusionsandOngoingWorkThispaperpresentedabenchmarktotrainandtestQAsystems,composedofi)theQALMontologies;ii)theQALMRDFdatasetofproductdescriptionsex-tractedfromeBayandBestBuy;andiii)theQALMQuestionSet,containing70naturallanguagequestionsinthecommercialdomainofphonesandaccessories.
Asforfuturework,wewillconsideraligningtheQALMontologytotheGoodRelationsontologytofullycoverthecommercialdomain,andtobenetfromthesemanticscapturedinthisontology.
WealsoconsiderimprovingtheQALMRDFdatasetbyi)extractingRDFdatafromadditionalcommercialwebsitesthatprovidewebservicesorAPIs;andii)directlyextractingRDFdataintheSchema.
orgontologyfromcommercialwebsiteswhosepagesareautomaticallygeneratedwithSchema.
orgmarkup(e.
g.
Magento,OSCommerce,Genesis2.
0,Prestashop),toextendthenumberofaddressedcommercialwebsites.
Inparallel,wearecurrentlydevelopingtheSynchroBotQAsystem[3],anontology-basedchatbotforthee-commercedomain.
WewillevaluateitbyusingtheproposedQALMbenchmark.
AcknowledgementsWethankAmazon,eBayandBestBuyforcontributingtothisworkbysharingwithuspublicdataabouttheircommercialproducts.
TheworkofE.
CabriowasfundedbytheFrenchGovernmentthroughtheANR-11-LABX-0031-01program.
References1.
Cimiano,P.
,Lopez,V.
,Unger,C.
,Cabrio,E.
,Ngomo,A.
C.
N.
,Walter,S.
:Multi-lingualquestionansweringoverlinkeddata(qald-3):Laboverview.
In:CLEF.
pp.
321–332(2013)2.
Fellbaum,C.
:WordNet:AnElectronicLexicalDatabase.
BradfordBooks(1998)3.
Hallili,A.
:Towardanontology-basedchatbotendowedwithnaturallanguagepro-cessingandgeneration.
In:Proc.
ofESSLLI2014-StudentSession,Posterpaper(2014)
justhost怎么样?justhost服务器好不好?JustHost是一家成立于2006年的俄罗斯服务器提供商,支持支付宝付款,服务器价格便宜,200Mbps大带宽不限流量,支持免费更换5次IP,支持控制面板自由切换机房,目前JustHost有俄罗斯6个机房可以自由切换选择,最重要的还是价格真的特别便宜,最低只需要87卢布/月,约8.5元/月起!总体来说,性价比很高,性价比不错,有需要的朋友可以...
青果云香港CN2_GIA主机测评青果云香港多线BGP网络,接入电信CN2 GIA等优质链路,测试IP:45.251.136.1青果网络QG.NET是一家高效多云管理服务商,拥有工信部颁发的全网云计算/CDN/IDC/ISP/IP-VPN等多项资质,是CNNIC/APNIC联盟的成员之一。青果云香港CN2_GIA主机性能分享下面和大家分享下。官方网站:点击进入CPU内存系统盘数据盘宽带ip价格购买地...
RAKsmart 虽然是美国主机商,但是商家的主要客户群还是在我们国内,于是我们可以看到每次的国内节日促销活动期间商家也会发布促销。包括这次年中大促活动,RAKsmart商家也有发布为期两个月的年终活动,其中有商家擅长的独立服务器和便宜VPS主机。服务器包括站群服务器、特价服务器、高达10G带宽不限制流量的美国服务器。商家优惠活动,可以看到对应商品的优惠,同时也可以使用 优惠码 RAKBL9 同时...
oscommerce为你推荐
全国企业信息查询全国企业信用信息公示系统查询入口 及操作说明哪里有?ym.163.com网易163企业邮箱的foxmail怎样设置?波音737起飞爆胎客机起飞的时候时速是多少?360防火墙在哪里怎么查找到360防火墙在自己电脑里的位置?并且关闭掉tplink01cuteftp商务软件EDI软件 包括那些软件?论坛勋章请教论坛勋章怎么做?站长统计如何给网站添加CNZZ站长统计网站日志iis日志详解,网站日志中的每一个数据代表什么超级用户在电脑上如何设置超级用户(Administrator)?
国外服务器租用 万网域名解析 申请免费域名 5折 singlehop wavecom 谷歌香港 外国空间 免费网站监控 网站被封 ibox官网 圣诞促销 微信收钱 新家坡 免费智能解析 电信主机 爱奇艺会员免费试用 申请网页 数据库空间 游戏服务器出租 更多