ontologiesoscommerce

oscommerce  时间:2021-04-12  阅读:()
QALM:aBenchmarkforQuestionAnsweringoverLinkedMerchantWebsitesDataAmineHallili1,ElenaCabrio2,3,andCatherineFaronZucker11Univ.
NiceSophiaAntipolis,CNRS,I3S,UMR7271,SophiaAntipolis,Franceamine.
hallili@inria.
fr;faron@unice.
fr2INRIASophiaAntipolisMediterranee,SophiaAntipolis,Franceelena.
cabrio@inria.
fr3EURECOM,SophiaAntipolis,FranceAbstract.
Thispaperpresentsabenchmarkfortrainingandevaluat-ingQuestionAnsweringSystemsaimingatmediatingbetweenauser,expressinghisorherinformationneedsinnaturallanguage,andseman-ticdatainthecommercialdomainofthemobilephonesindustry.
WerstdescribetheRDFdatasetweextractedthroughtheAPIsofmer-chantwebsites,andtheschemasonwhichitrelies.
Wethenpresentthemethodologyweappliedtocreateasetofnaturallanguagequestionsexpressingpossibleuserneedsintheabovementioneddomain.
Suchquestionsethasthenbeenfurtherannotatedbothwiththecorrespond-ingSPARQLqueries,andwiththecorrectanswersretrievedfromthedataset.
1IntroductionTheevolutionofthee-commercedomain,especiallytheBusinessToClient(B2C),hasencouragedtheimplementationandtheuseofdedicatedapplica-tions(e.
g.
QuestionAnsweringSystems)tryingtoprovideend-userswithabet-terexperience.
Atthesametime,theuser'sneedsaregettingmoreandmorecomplexandspecic,especiallywhenitcomestocommercialproductswhosequestionsconcernmoreoftentheirtechnicalaspects(e.
g.
price,color,seller,etc.
).
Severalsystemsareproposingsolutionstoanswertotheseneeds,butmanychal-lengeshavenotbeenovercomeyet,leavingroomforimprovement.
Forinstance,federatingseveralcommercialknowledgebasesinoneknowledgebasehasnotbeenaccomplishedyet.
Also,understandingandinterpretingcomplexnaturallanguagequestionsalsoknownasn-relationquestionsseemstobeoneoftheambitioustopicsthatsystemsarecurrentlytryingtogureout.
InthispaperwepresentabenchmarkfortrainingandevaluatingQuestionAnswering(QA)Systemsaimingatmediatingbetweenauser,expressinghisorherinformationneedinnaturallanguage,andsemanticdatainthecommercialdomainofthemobilephoneindustry.
WerstdescribetheRDFdatasetthatwehaveextractedthroughtheAPIsofmerchantsites,andtheschemasonwhichitrelies.
Wethenpresentthemethodologyweappliedtocreateasetofnaturallan-guagequestionsexpressingpossibleuserneedsintheabovementioneddomain.
SuchquestionsethasthenbefurtherannotatedbothwiththecorrespondingSPARQLqueries,andwiththecorrectanswersretrievedfromthedataset.
2AMerchantSitesDatasetfortheMobilePhonesIndustryThissectiondescribestheQALM(QuestionAnsweringoverLinkedMerchantwebsites)ontology(Section2.
1),andtheRDFdataset(Section2.
2)webuiltbyextractingasampleofdatafromasetofcommercialwebsites.
2.
1QALMOntologyTheQALMRDFdatasetreliesontwoontologies:theMerchantSiteOntology(MSO)andthePhoneOntology(PO).
TogethertheybuilduptheQALMOn-tology.
4MSOmodelsgeneralconceptsofmerchantwebsites,anditisalignedtothecommercialpartoftheSchema.
orgontology.
MSOiscomposedof5classes:mso:Product,mso:Seller,mso:Organization,mso:Store,mso:ParcelDelive-ry,andof29properties(e.
g.
mso:price,mso:url,mso:location,mso:seller)declaredassubclassesandsubpropertiesofSchema.
orgclassesandproperties.
Weaddedtothemmultilinguallabels(bothinEnglishandinFrench),thatcanbeexploitedbyQAsystemsinparticularforpropertyidenticationinthequestioninterpretationstep.
WereliedonWordNetsynonyms[2]toextractasmuchlabelsaspossible.
Forexample,thepropertymso:pricehasthefollowingEnglishlabels:"price","cost","value","tari","amount",andthefollowingFrenchlabels:"prix","cout","couter","valoir","tarif","s'elever".
POisadomainontologymodelingconceptsspecictothephoneindus-try.
Itiscomposedof7classes(e.
g.
po:Phone,po:Accessory)whicharede-claredassubclassesofmso:Product,andof35properties(e.
g.
po:handsetType,po:operatingSystem,po:phoneStyle).
2.
2QALMRDFDatasetOurnalgoalistobuildauniedRDFdatasetintegratingcommercialproductdescriptionsfromvariouse-commercewebsites.
Inordertoachievethisgoal,weanalyzethewebservicesofthee-commercewebsitesregardlessoftheirtype(eitherSOAPorREST).
Tofeedourdataset,wecreateamappingbetweentheremotecallstothewebservicesandtheontologyproperties,thatwestoreinaseparateleforreuse.
Inparticular,webuilttheQALMRDFdatasetbyextractingdatafromeBay5andBestBuy6commercialwebsitesthroughBestBuyWebserviceandeBayAPI.
TheextractedrawdataistransformedintoRDFtriplesbyapplyingtheabovedescribedmappingbetweentheQALMontology4Availableatwww.
i3s.
unice.
fr/qalm/ontology5http://www.
ebay.
com/6http://www.
bestbuy.
com/andtheAPI/webservice.
Forinstance,themethodgetPrice()intheeBayAPIismappedtothepropertymso:priceintheQALMontology.
Currently,theQALMdatasetcomprises500000productdescriptionsandupto15millionstriplesextractedfromeBayandBestBuy.
73QALMQuestionSetInordertotrainandtoevaluateaQAsystemmediatingbetweenauserandsemanticdataintheQALMdataset,asetofquestionsrepresentingusersre-questsinthephoneindustrydomainisrequired.
Uptoourknowledge,theonlyavailablestandardsetsofquestionstoevaluateQAsystemsoverlinkeddataaretheonesreleasedbytheorganizersoftheQALD(QuestionAnsweringoverLinkedData)challenges.
8HoweversuchquestionsareovertheEnglishDBpediadataset9,andthereforecoverseveraltopics.
Forthisreason,wecreatedasetofnaturallanguagequestionsforthespeciccommercialdomainofthephoneindustry,followingtheguidelinesdescribedbytheQALDorganizersforthecreationoftheirquestionsets[1].
Morespecically,thesequestionswerecre-atedby12externalpeople(studentsandresearchersinothergroups)withnobackgroundinquestionanswering,inordertoavoidabiastowardsaparticularapproach.
Toaccomplishthetaskofquestioncreation,eachpersonwasgiveni)thelistoftheproducttypespresentintheQALMdataset(mainlycomposedofITproductsasphonesandaccessories);ii)thelistofthepropertiesoftheQALMontologypresentedasproductfeaturesinwhichtheycouldbeinterestedin;andtheywereaskedtoproducei)both1-relationand2-relationquestions,andii)atleast5questionseach.
Thequestionsweredesignedtopresentpotentialuserquestionsandtoincludeawiderangeofchallengessuchaslexicalambiguitiesandcomplexsyntacticalstructures.
SuchquestionswerethenannotatedwiththecorrespondingSPARQLqueries,andthecorrectanswersretrievedfromthedataset,inordertoconsiderthemasareliablegoldstandardforourbenchmark.
Thenalquestionsetcomprises70questions;itisdividedintoatrainingset10andatestsetofrespectively40and30questions.
AnnotationsareprovidedinXMLformat,andaccordingtoQALDguidelines,thefollowingattributesarespeciedforeachquestionalongwithitsID:aggregation(indicateswhetheranyoperationbeyondtriplepatternmatchingisrequiredtoanswerthequestion,e.
g.
,counting,ltering,ordering),answertype(givestheanswertype:resource,string,boolean,double,date).
Wealsoaddedtheattributerelations,toindicatewhetherthequestionisconnectedtoitsanswerthroughoneormorepropertiesoftheontology(values:1,n).
Finally,foreachquestionthecorrespondingSPARQLqueryisprovided,aswellastheanswersthisqueryreturns.
Examples1and2showsomequestionsfromthecollectedquestionset,connectedtotheiranswersthrough1propertyormorethan1propertyoftheontology,respectively.
In7Availableatwww.
i3s.
unice.
fr/QALM/qalm.
rdf8http://greententacle.
techfak.
uni-bielefeld.
de/~cunger/qald/9http://dbpedia.
org10Availableatwww.
i3s.
unice.
fr/QALM/training_questions.
xmlparticular,questions14and50fromExample2requirealsotocarryoutsomereasoningontheresults,inordertorankthemandtoproducethecorrectanswer.
Example1.
1-relationquestions.
id=36.
Givemethemanufacturerswhosupplyon-earheadphones.
id=52.
WhatcolorsareavailablefortheSamsungGalaxy5id=61.
WhichproductsofAlcatelareavailableonlineExample2.
n-relationsquestions.
id=14.
Whichcellphonecase(anymanufacturer)hasthemostratingsid=50.
WhatisthehighestcameraresolutionofphonesmanufacturedbyMotorolaid=58.
IwouldliketoknowinwhichstoresIcanbuyApplephones.
4ConclusionsandOngoingWorkThispaperpresentedabenchmarktotrainandtestQAsystems,composedofi)theQALMontologies;ii)theQALMRDFdatasetofproductdescriptionsex-tractedfromeBayandBestBuy;andiii)theQALMQuestionSet,containing70naturallanguagequestionsinthecommercialdomainofphonesandaccessories.
Asforfuturework,wewillconsideraligningtheQALMontologytotheGoodRelationsontologytofullycoverthecommercialdomain,andtobenetfromthesemanticscapturedinthisontology.
WealsoconsiderimprovingtheQALMRDFdatasetbyi)extractingRDFdatafromadditionalcommercialwebsitesthatprovidewebservicesorAPIs;andii)directlyextractingRDFdataintheSchema.
orgontologyfromcommercialwebsiteswhosepagesareautomaticallygeneratedwithSchema.
orgmarkup(e.
g.
Magento,OSCommerce,Genesis2.
0,Prestashop),toextendthenumberofaddressedcommercialwebsites.
Inparallel,wearecurrentlydevelopingtheSynchroBotQAsystem[3],anontology-basedchatbotforthee-commercedomain.
WewillevaluateitbyusingtheproposedQALMbenchmark.
AcknowledgementsWethankAmazon,eBayandBestBuyforcontributingtothisworkbysharingwithuspublicdataabouttheircommercialproducts.
TheworkofE.
CabriowasfundedbytheFrenchGovernmentthroughtheANR-11-LABX-0031-01program.
References1.
Cimiano,P.
,Lopez,V.
,Unger,C.
,Cabrio,E.
,Ngomo,A.
C.
N.
,Walter,S.
:Multi-lingualquestionansweringoverlinkeddata(qald-3):Laboverview.
In:CLEF.
pp.
321–332(2013)2.
Fellbaum,C.
:WordNet:AnElectronicLexicalDatabase.
BradfordBooks(1998)3.
Hallili,A.
:Towardanontology-basedchatbotendowedwithnaturallanguagepro-cessingandgeneration.
In:Proc.
ofESSLLI2014-StudentSession,Posterpaper(2014)

knownhost西雅图/亚特兰大/阿姆斯特丹$5/月,2个IP1G内存/1核/20gSSD/1T流量

美国知名管理型主机公司,2006年运作至今,虚拟主机、VPS、云服务器、独立服务器等业务全部采用“managed”,也就是人工参与度高,很多事情都可以人工帮你处理,不过一直以来价格也贵。也不知道knownhost什么时候开始运作无管理型业务的,估计是为了扩展市场吧,反正是出来较长时间了。闲来无事,那就给大家介绍下“unmanaged VPS”,也就是无管理型VPS,低至5美元/月,基于KVM虚拟,...

Hosteons:洛杉矶/纽约/达拉斯免费升级10Gbps端口,KVM年付21美元起

今年1月的时候Hosteons开始提供1Gbps端口KVM架构VPS,目前商家在LET发布消息,到本月30日之前,用户下单洛杉矶/纽约/达拉斯三个地区机房KVM主机可以从1Gbps免费升级到10Gbps端口,最低年付仅21美元起。Hosteons是一家成立于2018年的国外VPS主机商,主要提供VPS、Hybrid Dedicated Servers及独立服务器租用等,提供IPv4+IPv6,支持...

iHostART:罗马尼亚VPS/无视DMCA抗投诉vps;2核4G/40GB SSD/100M端口月流量2TB,€20/年

ihostart怎么样?ihostart是一家国外新商家,主要提供cPanel主机、KVM VPS、大硬盘存储VPS和独立服务器,数据中心位于罗马尼亚,官方明确说明无视DMCA,对版权内容较为宽松。有需要的可以关注一下。目前,iHostART给出了罗马尼亚vps的优惠信息,罗马尼亚VPS无视DMCA、抗投诉vps/2核4G内存/40GB SSD/100M端口月流量2TB,€20/年。点击直达:ih...

oscommerce为你推荐
phpweb破解宽带无线网是WPAPSK会被破解吗flashwind谁能教我怎么在360里下载个flashwind?360邮箱免费注册360账号-电子邮箱怎么填写?ipad代理如何贷款买IPAD什么是通配符dir是什么400电话查询能查出400电话是什么地区的吗kingcmsKingCMS 开始该则呢么设置呢?社区动力你为什么想当一名社区工作者长沙电话号码升位长沙的座机什么时候变成8位的,急!在线等答案,那如果之前的7位数是不是都会变啊?变成什么样了呢?搜索引擎教程如何利用搜索引擎获取有效的信息?
新网域名管理 google镜像 主机 安云加速器 163网 双12活动 godaddy优惠券 网通ip 193邮箱 200g硬盘 tna官网 申请免费空间和域名 空间登入 starry 韩国代理ip 中国linux 贵阳电信 万网空间 摩尔庄园注册 网页加速 更多