ontologiesoscommerce

oscommerce  时间:2021-04-12  阅读:()
QALM:aBenchmarkforQuestionAnsweringoverLinkedMerchantWebsitesDataAmineHallili1,ElenaCabrio2,3,andCatherineFaronZucker11Univ.
NiceSophiaAntipolis,CNRS,I3S,UMR7271,SophiaAntipolis,Franceamine.
hallili@inria.
fr;faron@unice.
fr2INRIASophiaAntipolisMediterranee,SophiaAntipolis,Franceelena.
cabrio@inria.
fr3EURECOM,SophiaAntipolis,FranceAbstract.
Thispaperpresentsabenchmarkfortrainingandevaluat-ingQuestionAnsweringSystemsaimingatmediatingbetweenauser,expressinghisorherinformationneedsinnaturallanguage,andseman-ticdatainthecommercialdomainofthemobilephonesindustry.
WerstdescribetheRDFdatasetweextractedthroughtheAPIsofmer-chantwebsites,andtheschemasonwhichitrelies.
Wethenpresentthemethodologyweappliedtocreateasetofnaturallanguagequestionsexpressingpossibleuserneedsintheabovementioneddomain.
Suchquestionsethasthenbeenfurtherannotatedbothwiththecorrespond-ingSPARQLqueries,andwiththecorrectanswersretrievedfromthedataset.
1IntroductionTheevolutionofthee-commercedomain,especiallytheBusinessToClient(B2C),hasencouragedtheimplementationandtheuseofdedicatedapplica-tions(e.
g.
QuestionAnsweringSystems)tryingtoprovideend-userswithabet-terexperience.
Atthesametime,theuser'sneedsaregettingmoreandmorecomplexandspecic,especiallywhenitcomestocommercialproductswhosequestionsconcernmoreoftentheirtechnicalaspects(e.
g.
price,color,seller,etc.
).
Severalsystemsareproposingsolutionstoanswertotheseneeds,butmanychal-lengeshavenotbeenovercomeyet,leavingroomforimprovement.
Forinstance,federatingseveralcommercialknowledgebasesinoneknowledgebasehasnotbeenaccomplishedyet.
Also,understandingandinterpretingcomplexnaturallanguagequestionsalsoknownasn-relationquestionsseemstobeoneoftheambitioustopicsthatsystemsarecurrentlytryingtogureout.
InthispaperwepresentabenchmarkfortrainingandevaluatingQuestionAnswering(QA)Systemsaimingatmediatingbetweenauser,expressinghisorherinformationneedinnaturallanguage,andsemanticdatainthecommercialdomainofthemobilephoneindustry.
WerstdescribetheRDFdatasetthatwehaveextractedthroughtheAPIsofmerchantsites,andtheschemasonwhichitrelies.
Wethenpresentthemethodologyweappliedtocreateasetofnaturallan-guagequestionsexpressingpossibleuserneedsintheabovementioneddomain.
SuchquestionsethasthenbefurtherannotatedbothwiththecorrespondingSPARQLqueries,andwiththecorrectanswersretrievedfromthedataset.
2AMerchantSitesDatasetfortheMobilePhonesIndustryThissectiondescribestheQALM(QuestionAnsweringoverLinkedMerchantwebsites)ontology(Section2.
1),andtheRDFdataset(Section2.
2)webuiltbyextractingasampleofdatafromasetofcommercialwebsites.
2.
1QALMOntologyTheQALMRDFdatasetreliesontwoontologies:theMerchantSiteOntology(MSO)andthePhoneOntology(PO).
TogethertheybuilduptheQALMOn-tology.
4MSOmodelsgeneralconceptsofmerchantwebsites,anditisalignedtothecommercialpartoftheSchema.
orgontology.
MSOiscomposedof5classes:mso:Product,mso:Seller,mso:Organization,mso:Store,mso:ParcelDelive-ry,andof29properties(e.
g.
mso:price,mso:url,mso:location,mso:seller)declaredassubclassesandsubpropertiesofSchema.
orgclassesandproperties.
Weaddedtothemmultilinguallabels(bothinEnglishandinFrench),thatcanbeexploitedbyQAsystemsinparticularforpropertyidenticationinthequestioninterpretationstep.
WereliedonWordNetsynonyms[2]toextractasmuchlabelsaspossible.
Forexample,thepropertymso:pricehasthefollowingEnglishlabels:"price","cost","value","tari","amount",andthefollowingFrenchlabels:"prix","cout","couter","valoir","tarif","s'elever".
POisadomainontologymodelingconceptsspecictothephoneindus-try.
Itiscomposedof7classes(e.
g.
po:Phone,po:Accessory)whicharede-claredassubclassesofmso:Product,andof35properties(e.
g.
po:handsetType,po:operatingSystem,po:phoneStyle).
2.
2QALMRDFDatasetOurnalgoalistobuildauniedRDFdatasetintegratingcommercialproductdescriptionsfromvariouse-commercewebsites.
Inordertoachievethisgoal,weanalyzethewebservicesofthee-commercewebsitesregardlessoftheirtype(eitherSOAPorREST).
Tofeedourdataset,wecreateamappingbetweentheremotecallstothewebservicesandtheontologyproperties,thatwestoreinaseparateleforreuse.
Inparticular,webuilttheQALMRDFdatasetbyextractingdatafromeBay5andBestBuy6commercialwebsitesthroughBestBuyWebserviceandeBayAPI.
TheextractedrawdataistransformedintoRDFtriplesbyapplyingtheabovedescribedmappingbetweentheQALMontology4Availableatwww.
i3s.
unice.
fr/qalm/ontology5http://www.
ebay.
com/6http://www.
bestbuy.
com/andtheAPI/webservice.
Forinstance,themethodgetPrice()intheeBayAPIismappedtothepropertymso:priceintheQALMontology.
Currently,theQALMdatasetcomprises500000productdescriptionsandupto15millionstriplesextractedfromeBayandBestBuy.
73QALMQuestionSetInordertotrainandtoevaluateaQAsystemmediatingbetweenauserandsemanticdataintheQALMdataset,asetofquestionsrepresentingusersre-questsinthephoneindustrydomainisrequired.
Uptoourknowledge,theonlyavailablestandardsetsofquestionstoevaluateQAsystemsoverlinkeddataaretheonesreleasedbytheorganizersoftheQALD(QuestionAnsweringoverLinkedData)challenges.
8HoweversuchquestionsareovertheEnglishDBpediadataset9,andthereforecoverseveraltopics.
Forthisreason,wecreatedasetofnaturallanguagequestionsforthespeciccommercialdomainofthephoneindustry,followingtheguidelinesdescribedbytheQALDorganizersforthecreationoftheirquestionsets[1].
Morespecically,thesequestionswerecre-atedby12externalpeople(studentsandresearchersinothergroups)withnobackgroundinquestionanswering,inordertoavoidabiastowardsaparticularapproach.
Toaccomplishthetaskofquestioncreation,eachpersonwasgiveni)thelistoftheproducttypespresentintheQALMdataset(mainlycomposedofITproductsasphonesandaccessories);ii)thelistofthepropertiesoftheQALMontologypresentedasproductfeaturesinwhichtheycouldbeinterestedin;andtheywereaskedtoproducei)both1-relationand2-relationquestions,andii)atleast5questionseach.
Thequestionsweredesignedtopresentpotentialuserquestionsandtoincludeawiderangeofchallengessuchaslexicalambiguitiesandcomplexsyntacticalstructures.
SuchquestionswerethenannotatedwiththecorrespondingSPARQLqueries,andthecorrectanswersretrievedfromthedataset,inordertoconsiderthemasareliablegoldstandardforourbenchmark.
Thenalquestionsetcomprises70questions;itisdividedintoatrainingset10andatestsetofrespectively40and30questions.
AnnotationsareprovidedinXMLformat,andaccordingtoQALDguidelines,thefollowingattributesarespeciedforeachquestionalongwithitsID:aggregation(indicateswhetheranyoperationbeyondtriplepatternmatchingisrequiredtoanswerthequestion,e.
g.
,counting,ltering,ordering),answertype(givestheanswertype:resource,string,boolean,double,date).
Wealsoaddedtheattributerelations,toindicatewhetherthequestionisconnectedtoitsanswerthroughoneormorepropertiesoftheontology(values:1,n).
Finally,foreachquestionthecorrespondingSPARQLqueryisprovided,aswellastheanswersthisqueryreturns.
Examples1and2showsomequestionsfromthecollectedquestionset,connectedtotheiranswersthrough1propertyormorethan1propertyoftheontology,respectively.
In7Availableatwww.
i3s.
unice.
fr/QALM/qalm.
rdf8http://greententacle.
techfak.
uni-bielefeld.
de/~cunger/qald/9http://dbpedia.
org10Availableatwww.
i3s.
unice.
fr/QALM/training_questions.
xmlparticular,questions14and50fromExample2requirealsotocarryoutsomereasoningontheresults,inordertorankthemandtoproducethecorrectanswer.
Example1.
1-relationquestions.
id=36.
Givemethemanufacturerswhosupplyon-earheadphones.
id=52.
WhatcolorsareavailablefortheSamsungGalaxy5id=61.
WhichproductsofAlcatelareavailableonlineExample2.
n-relationsquestions.
id=14.
Whichcellphonecase(anymanufacturer)hasthemostratingsid=50.
WhatisthehighestcameraresolutionofphonesmanufacturedbyMotorolaid=58.
IwouldliketoknowinwhichstoresIcanbuyApplephones.
4ConclusionsandOngoingWorkThispaperpresentedabenchmarktotrainandtestQAsystems,composedofi)theQALMontologies;ii)theQALMRDFdatasetofproductdescriptionsex-tractedfromeBayandBestBuy;andiii)theQALMQuestionSet,containing70naturallanguagequestionsinthecommercialdomainofphonesandaccessories.
Asforfuturework,wewillconsideraligningtheQALMontologytotheGoodRelationsontologytofullycoverthecommercialdomain,andtobenetfromthesemanticscapturedinthisontology.
WealsoconsiderimprovingtheQALMRDFdatasetbyi)extractingRDFdatafromadditionalcommercialwebsitesthatprovidewebservicesorAPIs;andii)directlyextractingRDFdataintheSchema.
orgontologyfromcommercialwebsiteswhosepagesareautomaticallygeneratedwithSchema.
orgmarkup(e.
g.
Magento,OSCommerce,Genesis2.
0,Prestashop),toextendthenumberofaddressedcommercialwebsites.
Inparallel,wearecurrentlydevelopingtheSynchroBotQAsystem[3],anontology-basedchatbotforthee-commercedomain.
WewillevaluateitbyusingtheproposedQALMbenchmark.
AcknowledgementsWethankAmazon,eBayandBestBuyforcontributingtothisworkbysharingwithuspublicdataabouttheircommercialproducts.
TheworkofE.
CabriowasfundedbytheFrenchGovernmentthroughtheANR-11-LABX-0031-01program.
References1.
Cimiano,P.
,Lopez,V.
,Unger,C.
,Cabrio,E.
,Ngomo,A.
C.
N.
,Walter,S.
:Multi-lingualquestionansweringoverlinkeddata(qald-3):Laboverview.
In:CLEF.
pp.
321–332(2013)2.
Fellbaum,C.
:WordNet:AnElectronicLexicalDatabase.
BradfordBooks(1998)3.
Hallili,A.
:Towardanontology-basedchatbotendowedwithnaturallanguagepro-cessingandgeneration.
In:Proc.
ofESSLLI2014-StudentSession,Posterpaper(2014)

ZJI全新上架香港站群服务器,4C段238个IP月付1400元起

ZJI本月新上线了香港葵湾机房站群服务器,提供4个C段238个IPv4,支持使用8折优惠码,优惠后最低每月1400元起。ZJI是原Wordpress圈知名主机商家:维翔主机,成立于2011年,2018年9月更名为ZJI,提供中国香港、台湾、日本、美国独立服务器(自营/数据中心直营)租用及VDS、虚拟主机空间、域名注册等业务,所选数据中心均为国内普遍访问速度不错的机房。葵湾二型(4C站群)CPU:I...

香港E3 16G 390元/ 香港E5*2 32G 600元/ 香港站群 4-8C 1200元/ 美国200G高防 900/ 日本100M 700元

3C云国内IDC/ISP资质齐全商家,与香港公司联合运营, 已超6年运营 。本次为大家带来的是双12特惠活动,香港美国日本韩国|高速精品|高防|站群|大带宽等产品齐全,欢迎咨询问价。3C云科技有限公司官方网站:http://www.3cccy.com/客服QQ:937695003网页客服:点击咨询客户QQ交流群:1042709810价目表总览升级内存 60元 8G内存升级硬盘 1T机械 90元 2...

Ceranetworks顶级合作伙伴 香港E3 16G 299元 香港E5 32G 650元 美国E3 16G 650元

提速啦(www.tisula.com)是赣州王成璟网络科技有限公司旗下云服务器品牌,目前拥有在籍员工40人左右,社保在籍员工30人+,是正规的国内拥有IDC ICP ISP CDN 云牌照资质商家,2018-2021年连续4年获得CTG机房顶级金牌代理商荣誉 2021年赣州市于都县创业大赛三等奖,2020年于都电子商务示范企业,2021年于都县电子商务融合推广大使。资源优势介绍:Ceranetwo...

oscommerce为你推荐
libcurlphp副刊2016年8月30日企业推广最常见的推广方式有哪些linux防火墙设置如何在Linux中启动/停止和启用/禁用FirewallD和Iptables防火墙支付宝蜻蜓发布想做支付宝蜻蜓刷脸支付的代理么?怎么做?dell服务器bios设置戴尔服务器720bios设置硬盘启动美要求解锁iPhoneiphone美版解锁硬解大概需要多少钱啊青岛网通测速网通,联通,长城这三个宽带哪个网速最快?我是青岛的3g手机有哪些3G手机???工具条工具栏不见了怎么办
申请免费域名 新网域名管理 securitycenter 加勒比群岛 tier NetSpeeder 湖南服务器托管 全站静态化 40g硬盘 我爱水煮鱼 什么是刀片服务器 台湾谷歌 免费网页空间 cloudlink 域名dns web应用服务器 韩国代理ip 中国linux 中国域名 阿里云邮箱登陆地址 更多