ErrorAnalysisofNamedEntityRecognitioninBCCWJMasaakiIchihara1KanakoKomiya1TomoyaIwakura2MaikoYamazaki31IbarakiUniversity3FujitsuLaboratoriesLtd.
,2TokyoInstituteofTechnology{11t4004s@hcs,kkomiya@mx}.
ibaraki.
ac.
jp,iwakura.
tomoya@jp.
fujitsu.
com,yamazaki@lr.
pi.
titech.
ac.
jp1IntroductionNamedEntityRecognitionisaprocessbywhichnamedentities(NEs)suchasthenamesofpersons,locations,andartifactsareextracted.
Mostnamedentityrecognitiontechniqueshavebeenstudiedonnewsarticles,however,theirperformancesondier-entdomaintextssuchasblogs,booksandmaga-zinesarestillnotevaluatedwell.
ThispaperreportsanerroranalysisofKNPonsixdomainsforreveal-ingcausesoferrorsforfurtherimprovementofNErecognition1.
2ErrorAnalysisofKNPonBCCWJJapanesedependencyandcasestructureanalyzerKNP2([2]and[3])wasusedasthenamedentityrecognizer.
TheversionsweusedwereKNPVer.
4.
11andJUMANVer.
7.
0.
Thesixgenres,"Q&Asites","whitepapers","blogs","books","magazines",and"newspaperar-ticles",inBalancedCorpusofContemporaryWrit-tenJapanese(BCCWJ)wereusedasthetargetcor-pora.
OnehundredthirtysixtextsextractedfromBC-CWJ,theyareavailableasClassA3,wereusedfortheexperiments.
TheyweremanuallyannotatedwithninekindsofNEthatweredenedbyInformationRetrievalandExtractionExercise(IREX)4.
TheseNEtypesarethenamesofpersons,locations,artifacts,dates,times,moneys,percents,andoptional5.
Theanno-tationwasdonebyvemembersofNEteamoftheProjectNextNLP,andcheckedbyfourmembersofit.
1ThispaperisanEnglishversionof(Ichiharaetal.
,2015)[1]withadditionalinformationandsomecorrections.
2http://nlp.
ist.
i.
kyoto-u.
ac.
jp/EN/index.
phpKNP3http://plata.
ar.
media.
kyoto-u.
ac.
jp/mori/research/NLR/JDC/ClassA-1.
list4http://nlp.
cs.
nyu.
edu/irex/index-e.
html5KNPdoesnotextractoptionaltags.
WecomparedKNPoutputswiththemanuallyan-notatedtextsandanalyzederrors.
Table1showstheperformancesofKNP.
Theequa-tionsofrecall,precision,accuracy,andF-measureareasfollows.
"Correct",thenumeratorofrecall,precision,andaccuracy,isthenumberofthecor-rectanswersofKNP.
"Annotated",thedenominatorofrecall,denotesthenumberoftheNEsthatweremanuallyannotated.
"KNPoutputs",thedenomi-natorofprecision,denotesthenumberoftheNEsthatKNPoutput.
Thedenominatorofaccuracyisthelogicalsum(OR)of"Annotated"and"KNPout-puts".
Thedenominatorsofrecall,precision,andac-curacyvarybecauseKNPsometimescannotextractsomeNEsandsometimesextractswronginforma-tion.
Also,anNEthatthesystemoutputsometimesconsistsofmultipleannotatedNEsasillustratedbyanexampleinFigure1andviceversa.
Table1showstherecallislowerthantheprecision.
KNP:PERSON/PERSONAnnotationLOCATION/LOCATIONLOCATION/LOCATIONFigure1:AnexampleofanNEKNPoutputincludesmultipleannotatedNEsRecall=CorrectAnnotated(1)Precision=CorrectKNPoutputs(2)Accuracy=CorrectAnnotated∪KNPoutputs(3)Fmeasure=2Recall·PrecisionRecall+Precision(4)Table1:PerformancesofKNPPerformanceRateCorrectDenominatorRecall61.
79%2641Precision74.
79%16322182Accuracy57.
95%2816F-measure67.
68Theerrorswereclassiedintothefollowingvetypes.
Exampleswereshownwithdescription.
NoextractionTheerrorwhereKNPdidnotex-tracttokensasanNEthoughtheywereanno-tated.
KNP:AnnotationARTIFACT/ARTIFACTNoannotationTheerrorwhereKNPextractedtokensasanNEthoughtheywerenotanno-tated.
KNP:PERSON/PERSONAnnotationWrongrangeTheerrorwhereKNPextractedto-kensasanNEandonlytherangewaswrong.
(Theextractedtokenswerepartiallyannotatedortheywerethepartoftheannotatedtokens.
)KNP1:PERSON/PERSONAnnotation1PERSON/PERSONKNP2:ORGANIZATION/ORGANIZATIONAnnotation2ORGANIZATION/ORGANIZATIONWrongtagTheerrorwhereKNPextractedtokensasanNEandonlythetagtypewaswrong.
KNP:PERSON/PERSONAnnotationLOCATION/LOCATIONWrongrangeandtagTheerrorwhereKNPex-tractedtokensasanNEandboththerangeandthetagtypewerewrong.
KNP:PERSON/PERSONAnnotationLOCATION/LOCATIONTable2:SummaryoferrorsErrortypeNumRateNoextraction61952.
28%Noannotation15913.
43%Wrongrange16213.
68%Wrongtag12710.
73%Wrongrangeandtag1179.
88%Allerrors1184100.
00%Table2showsasummaryoferrors.
Theseerrorswerecountedbythelogicalsum(OR)ofannotatedNEsandKNPoutputs.
Themostfrequenterrorwas"Noextraction"anditaccountedformorethanhalfofthetotalerrors.
Thesecondmostfrequenter-rorwas"Wrongrange"andmostofthemweretheerrorswhereextractedtokenswerethepartoftheannotatedtokens.
Table3showsasummaryoferrorsbytypesofNEs.
Theseerrorswerealsocountedbythelogi-calsum(OR)ofannotatedNEsandKNPoutputs.
"Correct"and"Error"arethenumbersofthecorrectanswersandtheerrorsofKNP.
"Total"isthesumof"Correct"and"Error".
"Noextraction"and"Er-rorswithextraction"inthetablemeanthenumbersof"Noextraction"andtheerrorsotherthan"Noex-traction",respectively.
"Noextractionrate"istheratioof"Noextraction"in"Error".
Table3showsthatnoextractionratesof"ARTI-FACT","PERCENT","TIME",and"OPTIONAL"areespeciallyhigh.
Atthesametime,therearesmallnumberofNEsof"PERCENT"and"TIME"inthecorpora.
Therefore,wecansee"ARTIFACT"isthebigreasonwhythenoextractionrateofalltagsishigh.
Noextractionrateof"OPTIONAL"is100%becauseKNPdoesnotextractOPTIONALsandthisisanotherreason.
Table3alsoshowsthatmostof"TIME","MONEY",and"PRECENT"werecorrectlytaggedbyKNPiftheyweretagged.
Mostoftheerrorswhentheywereextractedarethoseof"ORGANIZA-TION","PERSON",and"LOCATION".
Thesumoferrorsof"ARTIFACT"and"DATE"arelessthan30%ofallerrorswhentheywereextracted.
Table4showstheaccuraciesandtheratesofnoextractionin"Total"accordingtothetagtype.
"Ac-curacy"istheratioofthecorrectanswersin"Total",thesumofcorrectanswersanderrorsofKNP,and"Noextraction/Total"istheratioofnoextractioninit.
Theseerrorswerealsocountedbythelogicalsum(OR)ofannotatedNEsandKNPoutputs.
Table4showsthattheaccuracyof"ARTIFACT"isparticularlylowcomparingwiththeothertags.
Thesametableshowstheratioofnoextractionin"Total"isalsohigh.
Therefore,wecouldseethat"Noextraction"of"ARTIFACT"isthebiggestcauseTable3:SummaryoferrorsbytypesofNEsTagCorrectErrorTotalNoextractionErrorswithextractionNoextractionrateARTIFACT902593491926774.
13%DATE343145488628342.
76%LOCATION4092266357215431.
86%MONEY884922250.
00%ORGANIZATION2362004367712338.
50%PERCENT79129110283.
33%PERSON3642225868813439.
64%TIME2393290100.
00%OPTIONAL01071071070100.
00%AllTags16321184281661956552.
28%Table4:Accuraciesandratesofnoextractionin"Total"accordingtothetagtypeTagAccuracyNoextraction/TotalARTIFACT25.
79%55.
01%DATE70.
29%12.
70%LOCATION64.
41%11.
34%MONEY95.
65%2.
17%ORGANIZATION54.
13%17.
66%PERCENT86.
81%10.
99%PERSON62.
12%15.
02%TIME71.
88%28.
13%OPTIONAL0.
00%100.
00%AllTags57.
95%21.
98%oftheerrorsofKNPandthemainreasonoflowrecall.
3ErrorAnalysisof"NoEx-traction"Thetargetcorporaweusedconsistedofsixgenres,"Q&Asites","whitepapers","blogs","books","magazines",and"newspaperarticles",inBCCWJ.
Table5showsasummaryoferrorsbygenresoftexts.
Theseerrorsexcept"Noextraction"arethosethatKNPoutput.
"Correct"and"Error"arethenumberofthecorrectanswersandtheerrorsofKNP.
"Total"isthesumof"Correct"and"Error".
"Noextraction"and"Errorswithextraction"intheta-blemeanthenumbersof"Noextraction"andtheerrorsotherthan"Noextraction",respectively.
"Noextractionrate"istheratioof"Noextraction"in"Error".
"Docs"isthenumberofdocumentsofthegenre.
Thetotalnumberoferrors(1169)andtotalnum-beroferrorswithextraction(550)aredierentfromthoseinTables2and3(1184and565).
Thisisbe-causesomeNEsthatKNPoutputincludemultipleTable6:Accuraciesandratesofnoextractionin"Total"accordingtothegenreGenreAccuracyNoextraction/TotalQ&A40.
00%44.
21%Whitepaper58.
73%20.
63%Blog50.
74%27.
89%Book50.
35%28.
07%Magazine53.
45%14.
66%Newspaper72.
27%15.
49%All58.
26%22.
10%annotatedNEs.
Inaddition,thenumberofwordsvariesaccordingtothegenre.
WethinkthisisareasonwhythetotalnumberoftheNEswasnotproportionaltothenumberofthedocuments.
Table5showsthatthegenrewhosenoextractionratewasthehighestwas"Q&Asites"andthegenrewiththelowestratewas"magazines".
Table6showstheaccuraciesandtheratesofnoextractionin"Total"accordingtothegenre.
"Accu-racy"istheratioofthecorrectanswersin"Total",thesumofcorrectanswersanderrorsofKNP,and"Noextraction/Total"istheratioofnoextractioninit.
Theseerrorsexcept"Noextraction"arethosethatKNPoutput.
"Accuracy"of"All"(58.
26%)isdierentfrom"Recall"inTable1(61.
79%)becausethenumberoftheNEsKNPoutputwasdierentfromthenumberoftheNEsthatwereannotatedbyhumans.
Table6showsthat"newspaperarticles"isthegenrewhoseaccuracyisthehighest.
WethinkthisisbecauseKNPwastrainedwithnewspaperarticlesofMAINICHISHIMBUN.
Table6alsoshowsthegenrewiththelowestaccuracywas"Q&Asites".
WethinkthisisbecausethewritingstyleofQ&Asiteswasthemostdierentfromthatofnewspaperarticles.
Thesametableshowsthatthegenrewhosenoextractionratewasthehighestwas"Q&Asites"Table5:SummaryoferrorsbygenresoftextsGenreCorrectErrorTotalNoextractionErrorswithextractionNoextractionrateDocsQ&A76114190843073.
68%74Whitepaper42730072715015050.
00%8Blog171166337947256.
63%34Book2172144311219356.
54%5Magazine1861623485111131.
48%2Newspaper5552137681199455.
87%13AllGenres16321169280161955052.
95%136andthegenrewiththelowestratewas"magazines".
3.
1NoExtractionofQ&ASites"Q&Asites"wasthegenrewhoseaccuracywasthelowest.
Theexamplesofnoextractionerrorsin"Q&Asites"areshownasfollows.
iManynamesofproducts,characters,andmedicineswerenotextracted.
(SakuraWars)(SuperNintendoEntertainmentSystem)(ActRaiser)4(Res-identEvil4)(KamenRider)(Ultraman)(Gundam)(Minostacin)(Aspirin)iiAbbreviationswerenotextracted.
Formalnamesarenotedinbrackets.
(MarioWorld)(SuperMarioWorld)GC((NintendoGameCube))JNB((JapanNetBank))LA((LosAngeles))iiiTheunusualdateexpressionswerenotextracted.
(90/11/21)ivHiraganaexpressionsweresometimeswronglyparsed.
"(Satoshi)"in"(CHIEBUKURERSatoshi)"shouldbethenameofpersonbutitiswronglyparsedas"(Satoru)":averb.
vNEswritteninalphabetsandnumberswerenotextracted.
"(JREast)"wereextracted.
3.
2NoExtractionofNewspaperAr-ticles"Newspaperarticles"wasthegenrewhoseaccuracywasthehighest.
Theexamplesofnoextractioner-rorsin"newspaperarticles"areshownasfollows.
iSomeNEswithspecicprexesandsuxeswerenotextracted.
(half**,ex.
halftime)(**region,ex.
(capitalregion)(threemajormetropolitanareas))(**area)(**point)(same**,ex.
(same**year)(sameday)(sameyearautumn))iiOPTIONALswerenotextractedbecauseKNPdoesnotextractoptionaltags.
iiiTheunusualEnglishexpressionsinJapanesesen-tenceswerenotextracted.
KOERAJAPANivBracketssometimescausetheerrors.
(Phoenix(Arizona,US))vNEsthatconsistofgeneralnounswerenotex-tracted.
Thiscouldbethereasonwhythenamesofproductsandcharacterswerenotextracted.
(Hirune,anap)(Zaurus)(FamilyMart)(Sharp)(TheRenaissance)"Softbank"sometimescouldbeextractedandsometimescouldnot.
Theywereparsedasnom-inativecasewhentheywereextractedandas"inclause"whentheywerenot.
4DiscussionAccordingtotheexamplesdescribed,wethinkthatthelackofknowledgeinthedictionaryandtheerrorsoftheparserarethebigreasonsoftheerrorsofthenamedentityrecognition.
Inparticular,thenamesofartifactsincludingthenamesofproductsorchar-actersareoftennewwordsthatwerecoined.
TheseNEsarenotinthedictionaryKNPusesandthere-fore,theyshouldbejudgediftheyweretheNEsornotdependsonthefeaturesofthesurroundingpat-ternsandthesyntacticfeatures.
Asaresult,thecorrectparsingwouldbeimportantfortheNEsthatcannotusedictionaryinformation.
However,theca-sualwritingstylelikeQ&Asitescausestheerrorsinmorphologicalanalysisandparsing.
Wethinkthatifthesentencesoftheseinformalwritingstylescouldbecorrectlyanalyzedandparsed,theerrorswouldbedecreased.
Thetrainingoftextswithinformalwritingstylescouldbethesolutionofthisproblem.
Inaddition,mostoftheNEsthatwerenotextractedbyKNPwerefoundinWikipediaorotherWebsites.
Thisinformationalsocouldhelptherecallimprove.
5ConclusionThispaperreportsanerroranalysisofthenamedentityrecognizerKNPonsixdomainsforrevealingcausesoferrors.
ThetextsofBCCWJweremanu-allyannotatedandcomparedwiththeautomaticallytaggedtexts.
Theanalysisrevealedthatthemostfrequenterrorwas"Noextraction":thecasewherethetokenswerenotextractedbyKNPthoughtheywereannotated.
Italsorevealedthat"Noextrac-tion"of"ARTIFACT"isthebiggestcauseoflowrecalland"Q&Asite"isthegenrewhoseaccuracyisthelowest.
Wefocusedonthenoextractionerrorsandfoundoutthatthelackofdictionaryinformationandthevariouswritingstylescausetheseerrors.
AcknowledgementsThisworkwaspartiallysupportedbyJSPSKAK-ENHIGrantNumber24700138.
WewouldliketothankDr.
RyoheiSasanowhoprovidesusthehelp-fulinformationaboutKNPandteammembersofNEteamofProjectNextNLP.
References[1]MasaakiIchihara,MaikoYamazaki,andKanakoKomiya.
Erroranalysisofnamedentityextrac-tioninbccwj(bccwj).
7,p.
toappear,2015.
[2]RyoheiSasanoandSadaoKurohashi.
Japanesenamedentityrecognitionusingnon-localinfor-mation(injapanese).
IPSJJournal,Vol.
49,No.
11,pp.
3765–3776,2008.
[3]knp.
,19,pp.
110–113,2013.
野草云月末准备了一些促销,主推独立服务器,也有部分云服务器,价格比较有性价比,佣金是10%循环,如果有时间请帮我们推推,感谢!公司名:LucidaCloud Limited官方网站:https://www.yecaoyun.com/香港独立服务器:CPU型号内存硬盘带宽价格购买地址E3-1230v216G240GB SSD或1TB 企盘30M299元/月点击购买E5-265016G240GB SS...
百纵科技:美国高防服务器,洛杉矶C3机房 独家接入zenlayer清洗 带金盾硬防,CPU全系列E52670、E52680v3 DDR4内存 三星固态盘阵列!带宽接入了cn2/bgp线路,速度快,无需备案,非常适合国内外用户群体的外贸、搭建网站等用途。C3机房,双程CN2线路,默认200G高防,3+1(高防IP),不限流量,季付送带宽美国洛杉矶C3机房套餐处理器内存硬盘IP数带宽线路防御价格/月套...
CloudCone商家在前面的文章中也有多次介绍,他们家的VPS主机还是蛮有特点的,和我们熟悉的DO、Linode、VuLTR商家很相似可以采用小时时间计费,如果我们不满意且不需要可以删除机器,这样就不扣费,如果希望用的时候再开通。唯独比较吐槽的就是他们家的产品太过于单一,一来是只有云服务器,而且是机房就唯一的MC机房。CloudCone 这次四周年促销活动期间,商家有新增独立服务器业务。同样的C...
softbank官网为你推荐
域名注册网注册域名上哪个网站最好重庆网站空间重庆有没有发展空间?韩国虚拟主机香港和韩国的虚拟主机哪个比较好?1g虚拟主机我要做一个下载资料类网站,刚买了一个虚拟主机1G的,提供商说一次,只能上传一个小于10M的文件云南虚拟主机云南虚拟主机,公司网站用本地客户,云南数据港怎么样?最好的虚拟主机谁来推荐一下哪里的虚拟主机比较好台湾虚拟主机我公司要购买一台香港虚拟主机,用于存放网站,目前是在万网购买了一年的虚拟主机。。。四川虚拟主机哪些网站适合租用独立服务器?域名网站有免费的网站域名吗中文域名中文域名有哪写类型?
什么是域名解析 3322动态域名 免费cn域名 edgecast 分销主机 海外服务器 suspended 宕机监控 http500内部服务器错误 全能主机 中国智能物流骨干网 anylink 165邮箱 吉林铁通 最漂亮的qq空间 免费邮件服务器 东莞idc cxz 国内域名 贵阳电信 更多