ErrorAnalysisofNamedEntityRecognitioninBCCWJMasaakiIchihara1KanakoKomiya1TomoyaIwakura2MaikoYamazaki31IbarakiUniversity3FujitsuLaboratoriesLtd.
,2TokyoInstituteofTechnology{11t4004s@hcs,kkomiya@mx}.
ibaraki.
ac.
jp,iwakura.
tomoya@jp.
fujitsu.
com,yamazaki@lr.
pi.
titech.
ac.
jp1IntroductionNamedEntityRecognitionisaprocessbywhichnamedentities(NEs)suchasthenamesofpersons,locations,andartifactsareextracted.
Mostnamedentityrecognitiontechniqueshavebeenstudiedonnewsarticles,however,theirperformancesondier-entdomaintextssuchasblogs,booksandmaga-zinesarestillnotevaluatedwell.
ThispaperreportsanerroranalysisofKNPonsixdomainsforreveal-ingcausesoferrorsforfurtherimprovementofNErecognition1.
2ErrorAnalysisofKNPonBCCWJJapanesedependencyandcasestructureanalyzerKNP2([2]and[3])wasusedasthenamedentityrecognizer.
TheversionsweusedwereKNPVer.
4.
11andJUMANVer.
7.
0.
Thesixgenres,"Q&Asites","whitepapers","blogs","books","magazines",and"newspaperar-ticles",inBalancedCorpusofContemporaryWrit-tenJapanese(BCCWJ)wereusedasthetargetcor-pora.
OnehundredthirtysixtextsextractedfromBC-CWJ,theyareavailableasClassA3,wereusedfortheexperiments.
TheyweremanuallyannotatedwithninekindsofNEthatweredenedbyInformationRetrievalandExtractionExercise(IREX)4.
TheseNEtypesarethenamesofpersons,locations,artifacts,dates,times,moneys,percents,andoptional5.
Theanno-tationwasdonebyvemembersofNEteamoftheProjectNextNLP,andcheckedbyfourmembersofit.
1ThispaperisanEnglishversionof(Ichiharaetal.
,2015)[1]withadditionalinformationandsomecorrections.
2http://nlp.
ist.
i.
kyoto-u.
ac.
jp/EN/index.
phpKNP3http://plata.
ar.
media.
kyoto-u.
ac.
jp/mori/research/NLR/JDC/ClassA-1.
list4http://nlp.
cs.
nyu.
edu/irex/index-e.
html5KNPdoesnotextractoptionaltags.
WecomparedKNPoutputswiththemanuallyan-notatedtextsandanalyzederrors.
Table1showstheperformancesofKNP.
Theequa-tionsofrecall,precision,accuracy,andF-measureareasfollows.
"Correct",thenumeratorofrecall,precision,andaccuracy,isthenumberofthecor-rectanswersofKNP.
"Annotated",thedenominatorofrecall,denotesthenumberoftheNEsthatweremanuallyannotated.
"KNPoutputs",thedenomi-natorofprecision,denotesthenumberoftheNEsthatKNPoutput.
Thedenominatorofaccuracyisthelogicalsum(OR)of"Annotated"and"KNPout-puts".
Thedenominatorsofrecall,precision,andac-curacyvarybecauseKNPsometimescannotextractsomeNEsandsometimesextractswronginforma-tion.
Also,anNEthatthesystemoutputsometimesconsistsofmultipleannotatedNEsasillustratedbyanexampleinFigure1andviceversa.
Table1showstherecallislowerthantheprecision.
KNP:PERSON/PERSONAnnotationLOCATION/LOCATIONLOCATION/LOCATIONFigure1:AnexampleofanNEKNPoutputincludesmultipleannotatedNEsRecall=CorrectAnnotated(1)Precision=CorrectKNPoutputs(2)Accuracy=CorrectAnnotated∪KNPoutputs(3)Fmeasure=2Recall·PrecisionRecall+Precision(4)Table1:PerformancesofKNPPerformanceRateCorrectDenominatorRecall61.
79%2641Precision74.
79%16322182Accuracy57.
95%2816F-measure67.
68Theerrorswereclassiedintothefollowingvetypes.
Exampleswereshownwithdescription.
NoextractionTheerrorwhereKNPdidnotex-tracttokensasanNEthoughtheywereanno-tated.
KNP:AnnotationARTIFACT/ARTIFACTNoannotationTheerrorwhereKNPextractedtokensasanNEthoughtheywerenotanno-tated.
KNP:PERSON/PERSONAnnotationWrongrangeTheerrorwhereKNPextractedto-kensasanNEandonlytherangewaswrong.
(Theextractedtokenswerepartiallyannotatedortheywerethepartoftheannotatedtokens.
)KNP1:PERSON/PERSONAnnotation1PERSON/PERSONKNP2:ORGANIZATION/ORGANIZATIONAnnotation2ORGANIZATION/ORGANIZATIONWrongtagTheerrorwhereKNPextractedtokensasanNEandonlythetagtypewaswrong.
KNP:PERSON/PERSONAnnotationLOCATION/LOCATIONWrongrangeandtagTheerrorwhereKNPex-tractedtokensasanNEandboththerangeandthetagtypewerewrong.
KNP:PERSON/PERSONAnnotationLOCATION/LOCATIONTable2:SummaryoferrorsErrortypeNumRateNoextraction61952.
28%Noannotation15913.
43%Wrongrange16213.
68%Wrongtag12710.
73%Wrongrangeandtag1179.
88%Allerrors1184100.
00%Table2showsasummaryoferrors.
Theseerrorswerecountedbythelogicalsum(OR)ofannotatedNEsandKNPoutputs.
Themostfrequenterrorwas"Noextraction"anditaccountedformorethanhalfofthetotalerrors.
Thesecondmostfrequenter-rorwas"Wrongrange"andmostofthemweretheerrorswhereextractedtokenswerethepartoftheannotatedtokens.
Table3showsasummaryoferrorsbytypesofNEs.
Theseerrorswerealsocountedbythelogi-calsum(OR)ofannotatedNEsandKNPoutputs.
"Correct"and"Error"arethenumbersofthecorrectanswersandtheerrorsofKNP.
"Total"isthesumof"Correct"and"Error".
"Noextraction"and"Er-rorswithextraction"inthetablemeanthenumbersof"Noextraction"andtheerrorsotherthan"Noex-traction",respectively.
"Noextractionrate"istheratioof"Noextraction"in"Error".
Table3showsthatnoextractionratesof"ARTI-FACT","PERCENT","TIME",and"OPTIONAL"areespeciallyhigh.
Atthesametime,therearesmallnumberofNEsof"PERCENT"and"TIME"inthecorpora.
Therefore,wecansee"ARTIFACT"isthebigreasonwhythenoextractionrateofalltagsishigh.
Noextractionrateof"OPTIONAL"is100%becauseKNPdoesnotextractOPTIONALsandthisisanotherreason.
Table3alsoshowsthatmostof"TIME","MONEY",and"PRECENT"werecorrectlytaggedbyKNPiftheyweretagged.
Mostoftheerrorswhentheywereextractedarethoseof"ORGANIZA-TION","PERSON",and"LOCATION".
Thesumoferrorsof"ARTIFACT"and"DATE"arelessthan30%ofallerrorswhentheywereextracted.
Table4showstheaccuraciesandtheratesofnoextractionin"Total"accordingtothetagtype.
"Ac-curacy"istheratioofthecorrectanswersin"Total",thesumofcorrectanswersanderrorsofKNP,and"Noextraction/Total"istheratioofnoextractioninit.
Theseerrorswerealsocountedbythelogicalsum(OR)ofannotatedNEsandKNPoutputs.
Table4showsthattheaccuracyof"ARTIFACT"isparticularlylowcomparingwiththeothertags.
Thesametableshowstheratioofnoextractionin"Total"isalsohigh.
Therefore,wecouldseethat"Noextraction"of"ARTIFACT"isthebiggestcauseTable3:SummaryoferrorsbytypesofNEsTagCorrectErrorTotalNoextractionErrorswithextractionNoextractionrateARTIFACT902593491926774.
13%DATE343145488628342.
76%LOCATION4092266357215431.
86%MONEY884922250.
00%ORGANIZATION2362004367712338.
50%PERCENT79129110283.
33%PERSON3642225868813439.
64%TIME2393290100.
00%OPTIONAL01071071070100.
00%AllTags16321184281661956552.
28%Table4:Accuraciesandratesofnoextractionin"Total"accordingtothetagtypeTagAccuracyNoextraction/TotalARTIFACT25.
79%55.
01%DATE70.
29%12.
70%LOCATION64.
41%11.
34%MONEY95.
65%2.
17%ORGANIZATION54.
13%17.
66%PERCENT86.
81%10.
99%PERSON62.
12%15.
02%TIME71.
88%28.
13%OPTIONAL0.
00%100.
00%AllTags57.
95%21.
98%oftheerrorsofKNPandthemainreasonoflowrecall.
3ErrorAnalysisof"NoEx-traction"Thetargetcorporaweusedconsistedofsixgenres,"Q&Asites","whitepapers","blogs","books","magazines",and"newspaperarticles",inBCCWJ.
Table5showsasummaryoferrorsbygenresoftexts.
Theseerrorsexcept"Noextraction"arethosethatKNPoutput.
"Correct"and"Error"arethenumberofthecorrectanswersandtheerrorsofKNP.
"Total"isthesumof"Correct"and"Error".
"Noextraction"and"Errorswithextraction"intheta-blemeanthenumbersof"Noextraction"andtheerrorsotherthan"Noextraction",respectively.
"Noextractionrate"istheratioof"Noextraction"in"Error".
"Docs"isthenumberofdocumentsofthegenre.
Thetotalnumberoferrors(1169)andtotalnum-beroferrorswithextraction(550)aredierentfromthoseinTables2and3(1184and565).
Thisisbe-causesomeNEsthatKNPoutputincludemultipleTable6:Accuraciesandratesofnoextractionin"Total"accordingtothegenreGenreAccuracyNoextraction/TotalQ&A40.
00%44.
21%Whitepaper58.
73%20.
63%Blog50.
74%27.
89%Book50.
35%28.
07%Magazine53.
45%14.
66%Newspaper72.
27%15.
49%All58.
26%22.
10%annotatedNEs.
Inaddition,thenumberofwordsvariesaccordingtothegenre.
WethinkthisisareasonwhythetotalnumberoftheNEswasnotproportionaltothenumberofthedocuments.
Table5showsthatthegenrewhosenoextractionratewasthehighestwas"Q&Asites"andthegenrewiththelowestratewas"magazines".
Table6showstheaccuraciesandtheratesofnoextractionin"Total"accordingtothegenre.
"Accu-racy"istheratioofthecorrectanswersin"Total",thesumofcorrectanswersanderrorsofKNP,and"Noextraction/Total"istheratioofnoextractioninit.
Theseerrorsexcept"Noextraction"arethosethatKNPoutput.
"Accuracy"of"All"(58.
26%)isdierentfrom"Recall"inTable1(61.
79%)becausethenumberoftheNEsKNPoutputwasdierentfromthenumberoftheNEsthatwereannotatedbyhumans.
Table6showsthat"newspaperarticles"isthegenrewhoseaccuracyisthehighest.
WethinkthisisbecauseKNPwastrainedwithnewspaperarticlesofMAINICHISHIMBUN.
Table6alsoshowsthegenrewiththelowestaccuracywas"Q&Asites".
WethinkthisisbecausethewritingstyleofQ&Asiteswasthemostdierentfromthatofnewspaperarticles.
Thesametableshowsthatthegenrewhosenoextractionratewasthehighestwas"Q&Asites"Table5:SummaryoferrorsbygenresoftextsGenreCorrectErrorTotalNoextractionErrorswithextractionNoextractionrateDocsQ&A76114190843073.
68%74Whitepaper42730072715015050.
00%8Blog171166337947256.
63%34Book2172144311219356.
54%5Magazine1861623485111131.
48%2Newspaper5552137681199455.
87%13AllGenres16321169280161955052.
95%136andthegenrewiththelowestratewas"magazines".
3.
1NoExtractionofQ&ASites"Q&Asites"wasthegenrewhoseaccuracywasthelowest.
Theexamplesofnoextractionerrorsin"Q&Asites"areshownasfollows.
iManynamesofproducts,characters,andmedicineswerenotextracted.
(SakuraWars)(SuperNintendoEntertainmentSystem)(ActRaiser)4(Res-identEvil4)(KamenRider)(Ultraman)(Gundam)(Minostacin)(Aspirin)iiAbbreviationswerenotextracted.
Formalnamesarenotedinbrackets.
(MarioWorld)(SuperMarioWorld)GC((NintendoGameCube))JNB((JapanNetBank))LA((LosAngeles))iiiTheunusualdateexpressionswerenotextracted.
(90/11/21)ivHiraganaexpressionsweresometimeswronglyparsed.
"(Satoshi)"in"(CHIEBUKURERSatoshi)"shouldbethenameofpersonbutitiswronglyparsedas"(Satoru)":averb.
vNEswritteninalphabetsandnumberswerenotextracted.
"(JREast)"wereextracted.
3.
2NoExtractionofNewspaperAr-ticles"Newspaperarticles"wasthegenrewhoseaccuracywasthehighest.
Theexamplesofnoextractioner-rorsin"newspaperarticles"areshownasfollows.
iSomeNEswithspecicprexesandsuxeswerenotextracted.
(half**,ex.
halftime)(**region,ex.
(capitalregion)(threemajormetropolitanareas))(**area)(**point)(same**,ex.
(same**year)(sameday)(sameyearautumn))iiOPTIONALswerenotextractedbecauseKNPdoesnotextractoptionaltags.
iiiTheunusualEnglishexpressionsinJapanesesen-tenceswerenotextracted.
KOERAJAPANivBracketssometimescausetheerrors.
(Phoenix(Arizona,US))vNEsthatconsistofgeneralnounswerenotex-tracted.
Thiscouldbethereasonwhythenamesofproductsandcharacterswerenotextracted.
(Hirune,anap)(Zaurus)(FamilyMart)(Sharp)(TheRenaissance)"Softbank"sometimescouldbeextractedandsometimescouldnot.
Theywereparsedasnom-inativecasewhentheywereextractedandas"inclause"whentheywerenot.
4DiscussionAccordingtotheexamplesdescribed,wethinkthatthelackofknowledgeinthedictionaryandtheerrorsoftheparserarethebigreasonsoftheerrorsofthenamedentityrecognition.
Inparticular,thenamesofartifactsincludingthenamesofproductsorchar-actersareoftennewwordsthatwerecoined.
TheseNEsarenotinthedictionaryKNPusesandthere-fore,theyshouldbejudgediftheyweretheNEsornotdependsonthefeaturesofthesurroundingpat-ternsandthesyntacticfeatures.
Asaresult,thecorrectparsingwouldbeimportantfortheNEsthatcannotusedictionaryinformation.
However,theca-sualwritingstylelikeQ&Asitescausestheerrorsinmorphologicalanalysisandparsing.
Wethinkthatifthesentencesoftheseinformalwritingstylescouldbecorrectlyanalyzedandparsed,theerrorswouldbedecreased.
Thetrainingoftextswithinformalwritingstylescouldbethesolutionofthisproblem.
Inaddition,mostoftheNEsthatwerenotextractedbyKNPwerefoundinWikipediaorotherWebsites.
Thisinformationalsocouldhelptherecallimprove.
5ConclusionThispaperreportsanerroranalysisofthenamedentityrecognizerKNPonsixdomainsforrevealingcausesoferrors.
ThetextsofBCCWJweremanu-allyannotatedandcomparedwiththeautomaticallytaggedtexts.
Theanalysisrevealedthatthemostfrequenterrorwas"Noextraction":thecasewherethetokenswerenotextractedbyKNPthoughtheywereannotated.
Italsorevealedthat"Noextrac-tion"of"ARTIFACT"isthebiggestcauseoflowrecalland"Q&Asite"isthegenrewhoseaccuracyisthelowest.
Wefocusedonthenoextractionerrorsandfoundoutthatthelackofdictionaryinformationandthevariouswritingstylescausetheseerrors.
AcknowledgementsThisworkwaspartiallysupportedbyJSPSKAK-ENHIGrantNumber24700138.
WewouldliketothankDr.
RyoheiSasanowhoprovidesusthehelp-fulinformationaboutKNPandteammembersofNEteamofProjectNextNLP.
References[1]MasaakiIchihara,MaikoYamazaki,andKanakoKomiya.
Erroranalysisofnamedentityextrac-tioninbccwj(bccwj).
7,p.
toappear,2015.
[2]RyoheiSasanoandSadaoKurohashi.
Japanesenamedentityrecognitionusingnon-localinfor-mation(injapanese).
IPSJJournal,Vol.
49,No.
11,pp.
3765–3776,2008.
[3]knp.
,19,pp.
110–113,2013.
7月份已经过去了一半,炎热的夏季已经来临了,主机圈也开始了大量的夏季促销攻势,近期收到一些商家投稿信息,提供欧美或者亚洲地区主机产品,价格优惠,这里做一个汇总,方便大家参考,排名不分先后,以邮件顺序,少部分因为促销具有一定的时效性,价格已经恢复故暂未列出。HostMem部落曾经分享过一次Hostmem的信息,这是一家提供动态云和经典云的国人VPS商家,其中动态云硬件按小时计费,流量按需使用;而经典...
PhotonVPS 服务商我们是不是已经很久没有见过?曾经也是相当的火爆的,我们中文习惯称作为饭桶VPS主机商。翻看之前的文章,在2015年之前也有较多商家的活动分享的,这几年由于服务商太多,乃至于有一些老牌的服务商都逐渐淡忘。这不有看到PhotonVPS商家发布促销活动。PhotonVPS 商家七月份推出首月半价Linux系统VPS主机,首月低至2.5美元,有洛杉矶、达拉斯、阿什本机房,除提供普...
之前几个月由于CHIA挖矿导致全球固态硬盘的价格疯涨,如今硬盘挖矿基本上已死,硬盘的价格基本上恢复到常规价位,所以,pacificrack决定对全系Cloud server进行价格调整,降幅较大,“如果您是老用户,请通过续费管理或升级套餐,获取同步到最新的定价”。官方网站:https://pacificrack.com支持PayPal、支付宝等方式付款VPS特征:基于KVM虚拟,纯SSD raid...
softbank官网为你推荐
租服务器开个小型公司,租个服务器需要多少钱?海外服务器租用外国服务器怎么租用?香港虚拟主机香港虚拟主机多少钱一年呢?香港虚拟主机推荐一下香港的虚拟主机公司!虚拟主机管理系统什么虚拟主机管理系统支持W和linux操作系统山东虚拟主机济宁梦网科技论坛虚拟主机论坛虚拟主机的IP地址在后台的那个地方呀虚拟主机试用30天需要一个免费的虚拟主机,稳定的域名解析什么是域名解析?它有什么作用?域名解析域名解析怎么弄?
万网虚拟主机 新通用顶级域名 腾讯云盘 免备案cdn bash漏洞 华为云主机 空间服务商 vip购优汇 北京双线机房 softbank邮箱 网站cdn加速 亚马逊香港官网 qq云端 傲盾官网 鲁诺 网游服务器 双线机房 网页提速 丽萨 徐州电信 更多