relysaintpaulon

saintpaulon  时间:2021-04-16  阅读:()
ConversationalSystems16.
345AutomaticSpeechRecognition(2003)ConversationalSystems*:AdvancesandChallengesIntroductionSpeechUnderstanding–NaturalLanguageUnderstanding–DiscourseResolution–DialogueModelingDevelopmentIssuesRecentProgressFutureChallengesSummary*AKAspokenlanguagesystemsorspokendialoguesystemsSeearticlebyZueandGlass(2000)Lecture#22Session2003ConversationalSystems26.
345AutomaticSpeechRecognition(2003)ThePremise:EverybodywantsInformationEverybodywantsInformationNeednewinterfacesSpeechisIt!
ForNorthAmericaCommerceNetResearchCenter(1999)EvenwhentheyareonthemoveEvenwhentheyareonthemoveTheinterfacemustbeeasytouseTheinterfacemustbeeasytouseDevicesmustbesmallIntroduction||NL||Development||Progress||ChallengesConversationalSystems36.
345AutomaticSpeechRecognition(2003)WhatAreConversationalSystemsSystemsthatcancommunicatewithusersthroughaconversationalparadigm,i.
e.
,theycan:–Understandverbalinput,using*Speechrecognition*Languageunderstanding(incontext)–Verbalizeresponse,using*Languagegeneration*Speechsynthesis–EngageindialoguewithauserduringtheinteractionIntroduction||NL||Development||Progress||ChallengesConversationalSystems46.
345AutomaticSpeechRecognition(2003)HumanComputerInitiativeHumantakescompletecontrolComputeristotallypassiveHumantakescompletecontrolComputeristotallypassiveH:Iwanttovisitmygrandmother.
ComputermaintainstightcontrolHumanishighlyrestrictedComputermaintainstightcontrolHumanishighlyrestrictedC:Pleasesaythedeparturecity.
DefiningtheContextConversationalsystemsdifferinthedegreewithwhichhumanorcomputertakestheinitiativeDirectedDialogueFreeFormDialogueMixedInitiativeDialogueIntroduction||NL||Development||Progress||ChallengesConversationalSystems56.
345AutomaticSpeechRecognition(2003)…….
.
C:Yeah,[um]I'mlookingfortheBufordCinema.
A:OK,andyou'rewantingtoknowwhat'sshowingthereor.
.
.
C:Yes,please.
A:AreyoulookingforaparticularmovieC:[um]What'sshowing.
A:OK,onemoment.
…….
.
A:They'reshowingATrollInCentralPark.
C:No.
A:Frankenstein.
C:WhattimeisthatonA:Seventwentyandninefifty.
C:OK,anyothersdisfluencyinterruption,overlapconfirmationclarificationbackchannelinferenceellipsisco-referenceTheNatureofMixedInitiativeInteractions(AHuman-HumanExample)MediaClipIntroduction||NL||Development||Progress||ChallengesConversationalSystems66.
345AutomaticSpeechRecognition(2003)4812162020+0102030405060%ofTurnsAVE#OFWORDS/TURNAgentClientOver1,000dialoguesinmanydomains(Flammia'98)Somelessonslearned(aboutclients):–Morethan80%ofutterancesare12wordsorless–MostshortutterancesareconfirmationandbackchannelcommunicationsStudyofhuman-humaninteractionscanleadtogoodinsightsinbuildinghuman-machinesystemsIntroduction||NL||Development||Progress||ChallengesConversationalSystems76.
345AutomaticSpeechRecognition(2003)DialogueManagementStrategiesDirecteddialoguescanbeimplementedasadirectedgraphbetweendialoguestates–Connectionsbetweenstatesarepredefined–Userisguidedthroughthegraphbythemachine–DirecteddialogueshavebeensuccessfullydeployedcommerciallyMixed-initiativedialoguesarepossiblewhenstatetransitionsdetermineddynamically–Transitionscanbedetermined,e.
g.
,byE-formvariablevalues–Userhasflexibilitytospecifyconstraintsinanyorder–Systemcan"backoff"toadirecteddialogueifdesired–Mixed-initiativedialoguesmainlyresearchprototypesIntroduction||NL||Development||Progress||ChallengesConversationalSystems86.
345AutomaticSpeechRecognition(2003)ExampleofMIT'sMercuryTravelPlanningSystemNewusercallingintoMercuryflightplanningsystemIllustratedtechnicalissues:–Back-offtodirecteddialoguewhennecessary(e.
g.
,password)–Understandingmid-streamcorrections(e.
g.
,"noWednesday")–Solicitingnecessaryinformationfromuser–Confirmingunderstoodconceptstouser–Summarizingmultipledatabaseresults–Allowingnegotiationwithuser–Articulatingpertinentinformation–Understandingfragmentsincontext(e.
g.
,"4:45")–Understandingrelativedates(e.
g.
,"thefollowingTuesday")–Quantifyingusersatisfaction(e.
g.
,questionnaire)Introduction||NL||Development||Progress||ChallengesConversationalSystems96.
345AutomaticSpeechRecognition(2003)TodayComponentsofaConversationalSystemDISCOURSECONTEXTDISCOURSECONTEXTDIALOGUEMANAGEMENTDIALOGUEMANAGEMENTDATABASEGraphs&TablesLANGUAGEUNDERSTANDINGLANGUAGEUNDERSTANDINGMeaningRepresentationMeaningRepresentationMeaningLANGUAGEGENERATIONLANGUAGEGENERATIONSPEECHSYNTHESISSPEECHSYNTHESISSpeechSentenceSPEECHRECOGNITIONSPEECHRECOGNITIONSpeechWordsIntroduction||NL||Development||Progress||ChallengesConversationalSystems106.
345AutomaticSpeechRecognition(2003)NaturalLanguageProcessingComponentsUnderstanding:–Parseinputqueryintoameaningrepresentation,tobeinterpretedforappropriateactionbyapplicationdomain–SelectbestcandidatefromproposedrecognizerhypothesesDiscourseResolution–InterpreteachqueryincontextofprecedingdialogueDialogueManagement–Plancourseofactionunderbothexpectedandunexpectedconditions;composeresponseframes.
Generation–Paraphraseuserqueriesintosameordifferentlanguage.
–Composewell-formedsentencestospeakthe(sequenceof)responseframespreparedbythedialoguemanager.
Introduction||NL||Development||Progress||ChallengesConversationalSystems116.
345AutomaticSpeechRecognition(2003)InputProcessing:UnderstandingLANGUAGEUNDERSTANDINGSemanticRepresentationSPEECHRECOGNITIONSpeechWaveformSentenceHypothesesClause:DISPLAYTopic:FLIGHTPredicate:FROMTopic:CITYName:"Boston"Predicate:TOTopic:CITYName:"Denver"Clause:DISPLAYTopic:FLIGHTPredicate:FROMTopic:CITYName:"Boston"Predicate:TOTopic:CITYName:"Denver"FLIGHTFLIGHTSDENVERSHOWTOBOSTONFROMMEONANDIntroduction||NL(NLU)||Development||Progress||ChallengesConversationalSystems126.
345AutomaticSpeechRecognition(2003)TypicalStepsinTransformingUserQueryParsing–EstablishessyntacticorganizationandsemanticcontentTranslationtoaSemanticFrame–ProducesmeaningrepresentationidentifyingrelevantconstituentsandtheirrelationshipsIncorporationofdiscoursecontext–Dealswithfragments,pronominalreferences,etc.
Translationtoadatabasequery–ProducesSQLformattedstringfordatabaseretrievalGenerateFrameIncorporateContextProduceDBQueryProduceParseTreeRecognizerHypothesesParseTreeSemanticFrameFrameinContextSQLIntroduction||NL(NLU)||Development||Progress||ChallengesConversationalSystems136.
345AutomaticSpeechRecognition(2003)NaturalLanguageUnderstandingshowmeflightsfrombostontodenverflightdestinationsourcetopicdisplayobjectpredicatefull_parsecommandsentencepredicatecitycitytofromflight_listdestinationsourceflightdisplaySomesyntacticnodescarrysemantictagsforcreatingsemanticframeClause:DISPLAYTopic:FLIGHTPredicate:FROMTopic:CITYName:"Boston"Predicate:TOTopic:CITYName:"Denver"Clause:DISPLAYTopic:FLIGHTPredicate:FROMTopic:CITYName:"Boston"Predicate:TOTopic:CITYName:"Denver"Introduction||NL(NLU)||Development||Progress||ChallengesConversationalSystems146.
345AutomaticSpeechRecognition(2003)ContextFreeRulesforExamplesentence→(display-clausetruth-clause…)display-clause→displaydirect-objectdirect-object→[determiner](flight-eventfare-event…)flight-event→flight[from-place][to-place]from-place→froma-cityto-place→toa-citydisplay→show-meshow-me→[please]show[me]a-city→(bostondallasdenver…)determiner→(athe).
.
.
Contextfree:lefthandsideofruleissinglesymbolbrackets[]:optionalParentheses():alternates.
TerminalwordsinitalicsIntroduction||NL(NLU)||Development||Progress||ChallengesShowmeflightsfromBostontoDenverConversationalSystems156.
345AutomaticSpeechRecognition(2003)WhatMakesParsingHardMustrealizehighcoverageofwell-formedsentenceswithindomainShoulddisallowill-formedsentences,e.
g.
,–theflightthatarrivinginthemorning–whatrestaurantsdoyouknowaboutanybanksAvoidparseambiguity(redundantparses)MaintainefficiencyIntroduction||NL(NLU)||Development||Progress||ChallengesConversationalSystems166.
345AutomaticSpeechRecognition(2003)UnderstandingWordsinContextSubtledifferencesinphrasingcanleadtocompletelydifferentinterpretations–IsthereasixA.
M.
flight–AretheresixA.
A.
flights–Isthereaflightsix–Isthereaflightatsix"six"couldmean:–Atime–Acount–AflightnumberThepossibilityofrecognitionerrorsmakesithardtorelyonfeatureslikethearticle"a"orthepluralityof"flights.
"Yetinsufficientsyntactic/semanticanalysiscanleadtogrossmisinterpretationsIntroduction||NL(NLU)||Development||Progress||ChallengesConversationalSystems176.
345AutomaticSpeechRecognition(2003)MultipleRolesforNaturalLanguageParsinginSpokenLanguageContextUnderstandingConstraintCoverage100%100%100%Introduction||NL(NLU)||Development||Progress||ChallengesConversationalSystems186.
345AutomaticSpeechRecognition(2003)Statisticallanguagemodels(i.
e.
,n-grams)usedforspeechrecognitionareinappropriateforspeechunderstandingapplications,becausetheydon'tprovideameaningrepresentationStatisticallanguagemodels(i.
e.
,n-grams)usedforspeechrecognitionareinappropriateforspeechunderstandingapplications,becausetheydon'tprovideameaningrepresentationTextbasednaturallanguageprocessingsystemsmaynotbewellsuitedforspeechunderstandingapplications,becausetheytypicallyassumethat:–Wordboundariesareknownwithcertainty–Allwordsareknownwithcertainty–Sentencesarewellformed–ConstraintsareunnecessaryTextbasednaturallanguageprocessingsystemsmaynotbewellsuitedforspeechunderstandingapplications,becausetheytypicallyassumethat:–Wordboundariesareknownwithcertainty–Allwordsareknownwithcertainty–Sentencesarewellformed–ConstraintsareunnecessaryContrastingLanguageModelsforSpeechRecognitionandNaturalLanguageUnderstandingIntroduction||NL(NLU)||Development||Progress||ChallengesConversationalSystems196.
345AutomaticSpeechRecognition(2003)SpokenLanguageUnderstandingSpokeninputdifferssignificantlyfromtext–Falsestarts–Filledpauses–Agrammaticalconstructs–RecognitionerrorsWeneedtodesignnaturallanguagecomponentsthatcanbothconstraintherecognizer'ssearchspaceandrespondappropriatelyevenwhentheinputspeechisnotfullyunderstoodIntroduction||NL(NLU)||Development||Progress||ChallengesConversationalSystems206.
345AutomaticSpeechRecognition(2003)ESPRITSPEECHSomeSpeech-RelatedGovernmentProgramsDARPASCARPASURBBN,CMU,LincolnSDC,SRI,.
.
.
HWIM,Harpy,HearsayDARPASLSATT,BBN,CMU,CRIM,MIT,SRI,Unisys,.
.
.
ATIS,Banking,DART,OM,VOYAGER,.
.
.
ESPRITSUNDIALCNET,CSELT,DaimlerBenz,LogicaAirandTrainTravelLE3ARISECSELT,IRIT,KPN,LIMSI,U.
Nijmegen.
.
TrainTravel1970199019802000DARPAWSJ/BND.
C.
ATT,BBN,CMU,CU,IBM,MIT,MITRE,SpeechWorks,SRI,+Affiliates,.
.
.
ComplexTravelESPRITMASKIntroduction||NL(ATIS)||Development||Progress||ChallengesConversationalSystems216.
345AutomaticSpeechRecognition(2003)TheU.
S.
DARPA-SLSProgram(1990-1995)TheCommunityadoptedacommontask(AirTravelInformationService,orATIS)tospurtechnologydevelopmentUserscouldverballyqueryastaticdatabaseforairtravelinformation–11citiesinNorthAmerica(ATIS-2)–Expandedto46citiesin1993(ATIS-3)–MostlyflightsandfaresAllsystemscouldhandlecontinuousspeechfromunknownspeakers(~2,000wordvocabulary)InfrastructurefortechnologydevelopmentandevaluationwasdevelopedFiveannualcommonevaluationstookplaceIntroduction||NL(ATIS)||Development||Progress||ChallengesConversationalSystems226.
345AutomaticSpeechRecognition(2003)DataSetClassAClassDClassXATIS-243%33%24%ATIS-349%33%18%DataSetClassAClassDClassXATIS-243%33%24%ATIS-349%33%18%A:Context-independentqueriesD:Context-dependentqueriesX:Un-answerablequeriesATISDataCollectionStatusOver25,000utteranceswerecollected(fromAT&T,BBN,CMU,MIT,NIST,andSRI)About80%ofthecollecteddata(speechandtranscriptions)weredistributedforsystemdevelopmentandtrainingOver11,000oftrainingutteranceswereannotatedwithdatabase"reference"answerAbout40%ofthedatafromATIS-3(morecities)Introduction||NL(ATIS)||Development||Progress||ChallengesConversationalSystems236.
345AutomaticSpeechRecognition(2003)SLSDatabaseTuplesPre-recordedDataDATABASEReferenceAnswerCompareScoreEvaluationofSLSUsingCommonAnswerSpecification(CAS)Evaluationisautomatic(i.
e.
,easy),oncewehave:–Principlesofinterpretation(e.
g.
,"red-eye")–Properlyannotateddata,and–ComparatorButitiscostly,anddoesnotaddressimportantresearchissuessuchasdialoguemodelingandsystemusefulnessIntroduction||NL(ATIS)||Development||Progress||ChallengesConversationalSystems246.
345AutomaticSpeechRecognition(2003)StateoftheArt(TheATISDomain)Word(alsoutterance)errorrate(ER)forspontaneousspeechapproachingthatforreadspeechUnderstandingERM-TOM-MILWAUKEEANDTHENFROMMILWAUKEETOTACOMATHANKYOUIntroduction||NL(ATIS)||Development||Progress||ChallengesConversationalSystems266.
345AutomaticSpeechRecognition(2003)WecannotexpectanynaturallanguagesystemtobeabletofullyparseandunderstandallsuchsentencesWecannotexpectanynaturallanguagesystemtobeabletofullyparseandunderstandallsuchsentencesDifficult,ButReal,SentencesIwouldliketofindaflightfromPittsburghtoBostononWednesdayandIhavetobeinBostonbyonesoIwouldlikeaflightoutofherenolaterthan11a.
m.
I'llrepeatwhatIsaidbeforeonscenario3Iwouldlikea727flightfromWashingtonDCtoAtlantaGeorgiaIwouldlikeitduringthehoursoffrom9a.
m.
till2p.
m.
ifIcangetaflightwithinthattimeframeandIwouldlikeitforFridaySomedatabaseI'minquiringaboutafirstclassflightoriginatingcityAtlantadestinationcityBostonanyclassfarewillbeallrightIntroduction||NL(ATIS)||Development||Progress||ChallengesConversationalSystems276.
345AutomaticSpeechRecognition(2003)HistoricalPerspectiveonKeyPlayersinATISEffortCMU:Strictlysemanticgrammar,syntacticinformationmostlyignoredMIT:GrammarrulesinterleavesyntacticandsemanticcategoriesBBN,SRI:–Initialsystemsusedsyntacticgrammarsbasedonunificationframework,withparallelsemanticrules–Bothsitesnowhaveastrictlysemanticgrammaraswell–SRIcombinestwooutputsintoonesystem;BBNhasseparatecompetingsystemsATT,BBN,IBM:StochasticapproachesusingHMMIntroduction||NL(ATIS)||Development||Progress||ChallengesConversationalSystems286.
345AutomaticSpeechRecognition(2003)okaythenextuhuh(i'mgoingtoneed)a(fromdenver)(abouttwoo'clock)and(gotoatlanta)okaythenextuhuh(i'mgoingtoneed)a(fromdenver)(abouttwoo'clock)and(gotoatlanta)ExampleCMU'sApproachGrammarconsistsof~70autonomoussemanticconcepts(e.
g.
,DepartLocation)Eachconceptisrealizedasasetofpossiblewordclasssequences,e.
g.
,DepartLocation=>[FROM][LOC]whicharespecifiedthroughrecursivetransitionnetworks(RTNs)Semanticframeisaflatstructureofkey-valuepairsasdefinedbytheconceptsSyntacticstructureisignoredRecognizeronlyproducesasingletheoryIntroduction||NL(ATIS)||Development||Progress||ChallengesConversationalSystems296.
345AutomaticSpeechRecognition(2003)MIT'sApproachTINAwasdesignedforspeechunderstanding–Grammarrulesintermixsyntaxandsemantics–Probabilitiesaretrainedfromuserutterances–ParsetreeisconvertedtoasemanticframethatencapsulatesthemeaningTINAenhancesitscoveragethrougharobustparsingstrategy–Sentencesthatfailtoparsearesubjectedtoafragmentparsestrategy–Fragmentsarecombinedintoafullsemanticframe–Whenallthingsfail,resorttowordspottingIntroduction||NL(ATIS)||Development||Progress||ChallengesConversationalSystems306.
345AutomaticSpeechRecognition(2003)StochasticApproachesSemanticModelLexicalModelwhattosayhowtosayitmeaningsentenceMSChooseamongallpossiblemeaningstheonethatmaximizes:(|)()PSMPMPMSPS=HMMtechniqueshavebeenusedtodeterminethemeaningofutterances(ATT,BBN,IBM)Encouragingresultshavebeenachieved,butalargebodyofannotateddataisneededfortrainingIntroduction||NL(ATIS)||Development||Progress||ChallengesConversationalSystems316.
345AutomaticSpeechRecognition(2003)NLRe-SortNComplete"sentence"hypothesesparsablesentencesSRbestscoringhypothesisspeechshowmeflightsfrombostontodenverandshowmeflightsfrombostontodenvershowmeflightsfrombostontodenveronshowmeflightfrombostontodenverandshowmeflightfrombostontodenvershowmeflightfrombostontodenveronshowmeflightsfrombostontodenverinshowmeaflightfrombostontodenverandshowmeaflightfrombostontodenvershowmeaflightfrombostontodenveronshowmeflightsfrombostontodenverandshowmeflightsfrombostontodenvershowmeflightsfrombostontodenveronshowmeflightfrombostontodenverandshowmeflightfrombostontodenvershowmeflightfrombostontodenveronshowmeflightsfrombostontodenverinshowmeaflightfrombostontodenverandshowmeaflightfrombostontodenvershowmeaflightfrombostontodenveronAnswerSR/NLIntegrationviaN-BestInterfaceN-BestresortinghasalsobeenusedasamechanismforapplyingcomputationallyexpensiveconstraintsIntroduction||NL(SR/NLIntegration)||Development||Progress||ChallengesConversationalSystems326.
345AutomaticSpeechRecognition(2003)AnA*algorithmisoftenusedtoconstructthetop-Nsentencehypothesesf*(p)=g(p)+h*(p)where:f*(p)istheestimatedscoreofthebestpathcontainingpartialpathpg(p)isthescorefromthebeginningtotheendofthepartialpathp,andh*(p)istheestimatedscoreofthebest-scoringextensionofpQuestions:–HowcaninformationintheN-bestlistbecapturedmoreeffectivelyshowameflightsflightbostonfromdenvertoandonin##SomeIssuesRelatedtoSearch–Whataresomecomputationallyefficientchoicesofh*(p),evenifinadmissibleIntroduction||NL(SR/NLIntegration)||Development||Progress||ChallengesConversationalSystems336.
345AutomaticSpeechRecognition(2003)TighterSR/NLIntegrationNaturallanguageanalysiscanprovidelongdistanceconstraintsthatn-gramscannotExamples:–Whatistheflightservesdinner–WhatmealsdoesflighttwoservedinnerQuestion:HowcanwedesignsystemsthatwilltakeadvantageofsuchconstraintsIntroduction||NL(SR/NLIntegration)||Development||Progress||ChallengesConversationalSystems346.
345AutomaticSpeechRecognition(2003)ByintroducingNLconstraintsearlier,onecanpotentiallyreducecomputationwhileimprovingperformanceNLRe-SortparsablesentencesSRbestscoringhypothesisspeechbestpartialtheorynextwordextensionsAlternativestoN-BestInterfaceEarlyintegrationcanalsoremovetheneedforastatisticallanguagemodel,whichmaybehardtoobtainforsomeapplicationsAsthevocabularysizeincreases,wemustbegintoexplorealternativesearchstrategies–Parallelsearch–FastsearchtoreducewordcandidatelistIntroduction||NL(SR/NLIntegration)||Development||Progress||ChallengesConversationalSystems356.
345AutomaticSpeechRecognition(2003)Generatingn-gramsfromParseTreesNLUcanhelpgenerateaconsistentclassn-gramtoidaho_fallsonmaytwenty_thirdSENTENCECLARIFIERDESTINATIONDATETOCITY_NAMEMONTHONDAYCARDINAL_DATEtoidahofallsonmaytwentythirdCITY_NAMEMONTHCARDINAL_DATEDeveloperidentifiesparsecategoriesforclassn-gramSystemtagswordswithassociatedclasslabelsIntroduction||NL(SR/NLIntegration)||Development||Progress||ChallengesConversationalSystems366.
345AutomaticSpeechRecognition(2003)SomeSR/NLCouplingExperiments(ATISDomain)MIT(Goddeau,1992)–ProbabilisticLRparser–IntegratedintorecognizerA*search–AchievedcomparablerecognitionaccuracytoN-bestresorting,butwithconsiderablymoreefficiencyCMU(Ward,1993)–Modeledsemanticconceptsequencesthroughtrigram;andterminalwordsequencesthroughbigram–IntegratedintorecognizerA*search–Reducedunderstanding(CAS)errorby10%SRI(Moore,1995)–Modelledsemanticallymeaningfulfragmentsthroughtrigram;andwordclassesthrough4-gram–TheNLscoreisaddedtothebasicrecognitionscore–Achieved~15%worderrorreductionIntroduction||NL(SR/NLIntegration)||Development||Progress||ChallengesConversationalSystems376.
345AutomaticSpeechRecognition(2003)TypicalDiscoursePhenomenainConversationalSystemsDeictic(verbalpointing)andanaphoric(e.
g.
,pronominal)reference:1.
ShowmetherestaurantsinCambridge.
2.
Whatisthephonenumberofthethirdone3.
HowdoIgettherefromthenearestsubwaystopEllipsis:1.
WhendoesflighttwentytwoarriveinDallas2.
Whatisthedeparturetime()Fragments:1.
WhatistheweathertodayinDenver2.
HowaboutSaltLakeCityIntroduction||NL(Discourse)||Development||Progress||ChallengesConversationalSystems386.
345AutomaticSpeechRecognition(2003)MIT'sDiscourseModuleInternalsDISCOURSEMODULEInputFrameDisplayedListResolveDeixisIncorporateFragmentsInterpretedFrameResolvePronounsResolveDefiniteNPFillObligatoryRolesUpdateHistoryElementsIntroduction||NL(Discourse)||Development||Progress||ChallengesConversationalSystems396.
345AutomaticSpeechRecognition(2003)U:IneedaflightfromBostontoSanFranciscoC:DidyousayBostonorAustinU:Boston,MassachusettsC:IneedadatebeforeIcanaccessTravelocityU:TomorrowC:HoldonwhileIretrievetheflightsforyouC:Ihavefound10flightsmeetingyourspecification.
WhenwouldyouliketoleaveU:Inthemorning.
C:DoyouhaveapreferredairlineU:UnitedC:Ifoundtwonon-stopUnitedflightsleavinginthemorning…HelptheusernarrowdownthechoicesClarification(insufficientinfo)Clarification(recognitionerrors)Post-Retrieval:MultipleDBRetrievals=>UniqueResponseDifferentRolesofDialogueManagementPre-Retrieval:AmbiguousInput=>UniqueQuerytoDBIntroduction||NL(Dialogue)||Development||Progress||ChallengesConversationalSystems406.
345AutomaticSpeechRecognition(2003)MultipleRolesofDialogueModelingOurdefinition:Foreachturn,preparingthesystem'ssideoftheconversation,includingresponsesandclarificationsResolveambiguities–Ambiguousdatabaseretrieval(e.
g.
London,EnglandorLondon,Kentucky)–Pragmaticconsiderations(e.
g.
,toomanyflightstospeak)Informandguideuser–Suggestsubsequentsub-goals(e.
g.
,whattime)–Offerdialogue-contextdependentassistanceuponrequest–Provideplausiblealternativeswhenrequestedinformationunavailable–Initiateclarificationsub-dialoguesforconfirmationInfluenceothersystemcomponents–Adjustlanguagemodelduetodialoguecontext–Adjustdiscoursehistoryduetopragmatics(e.
g.
,NewYork)Introduction||NL(Dialogue)||Development||Progress||ChallengesConversationalSystems416.
345AutomaticSpeechRecognition(2003)AnAttractiveStrategyConductR&Dofhumanlanguagetechnologieswithinthecontextofrealapplicationdomains–Forcesusto:*Confrontcriticaltechnicalissues(e.
g.
,rejection,newwordproblem)and*Setpriorities(e.
g.
,bettermatchtechnicalcapabilitieswithusefulapplications)–Providesarichandcontinuingsourceofusefuldata*Realdatafromrealusersareinvaluable–Demonstratestheusefulnessofthetechnology–FacilitatestechnologytransferIntroduction||NL||Development||Progress||ChallengesConversationalSystems426.
345AutomaticSpeechRecognition(2003)SystemRefinementLimitedNLCapabilitiesDataCollection(Wizard)PerformanceEvaluationExpandedNLCapabilitiesSpeechRecognitionDataCollection(Wizard-less)SystemDevelopmentCycleIntroduction||NL||Development||Progress||ChallengesConversationalSystems436.
345AutomaticSpeechRecognition(2003)DataCollectionSystemdevelopmentischicken&eggproblemDatacollectionhasevolvedconsiderably–Wizard-based→system-baseddatacollection–Laboratorydeployment→publicdeployment–100sofusers→thousands→millionsDatafromrealuserssolvingrealproblemsacceleratestechnologydevelopment–Significantlydifferentfromlaboratoryenvironment–Highlightsweaknesses,allowscontinuousevaluation–But,requiressystemsprovidingrealinformation!
ExpandingcorporawillrequireunsupervisedtrainingoradaptationtounlabelleddataIntroduction||NL||Development||Progress||ChallengesConversationalSystems446.
345AutomaticSpeechRecognition(2003)Datavs.
Performance(WeatherDomain)LongitudinalevaluationsshowimprovementsCollectingrealdataimprovesperformance:–Enablesincreasedcomplexityandimprovedrobustnessforacousticandlanguagemodels–BettermatchthanlaboratoryrecordingconditionsUserscomeinallkinds051015202530354045AprMayJunJulAugNovAprNovMayErrorRate(%)110100TrainingData(x1000)WordDataIntroduction||NL||Development||Progress||ChallengesConversationalSystems456.
345AutomaticSpeechRecognition(2003)010203040506070EntireSetInDomain(ID)Male(ID)Female(ID)Child(ID)Non-native(ID)OutofDomainExpert%ErrorRateSentenceWordMaleERsarebetterthanfemales(1.
5x)andchildren(2x)Strongforeignaccentsandout-of-domainqueriesarehardExperiencedusersare5xbetterthannovicesUnderstandingerrorrateisconsistentlylowerthanSERASRErrorAnalysis(WeatherDomain)Introduction||NL||Development||Progress||ChallengesConversationalSystems466.
345AutomaticSpeechRecognition(2003)ExamplesofSpokenDialogueSystemsCanonTARSAN(Japanese)–InforetrievalfromCD-ROMInfoTalk(Cantonese)–TransitfareKDDACTIS(Japanese)–Area-codes,country-codesandtime-differenceNEC(Japanese)–TicketreservationNTT(Japanese)–DirectoryassistanceSpeechWorks(Chinese)–StockquotesToshibaTOSBURG(Japanese)–FastfoodorderingCanonTARSAN(Japanese)–InforetrievalfromCD-ROMInfoTalk(Cantonese)–TransitfareKDDACTIS(Japanese)–Area-codes,country-codesandtime-differenceNEC(Japanese)–TicketreservationNTT(Japanese)–DirectoryassistanceSpeechWorks(Chinese)–StockquotesToshibaTOSBURG(Japanese)–FastfoodorderingAsiaU.
S.
AT&THowMayIHelpYou,.
.
.
BBNCallRoutingCMUMovieline,Travel,.
.
.
ColoradoUTravelIBMMutualfunds,TravelLucentMovies,CallRouting,.
.
.
MITJupiter,Voyager,Pegasus,.
.
–Weather,navigation,flightinfoNuanceFinance,Travel,…OGICSLUToolkitSpeechWorksFinance,Travel,.
.
.
UC-BerkeleyBERP–RestaurantinformationURochesterTRAINS–SchedulingtrainsAT&THowMayIHelpYou,.
.
.
BBNCallRoutingCMUMovieline,Travel,.
.
.
ColoradoUTravelIBMMutualfunds,TravelLucentMovies,CallRouting,.
.
.
MITJupiter,Voyager,Pegasus,.
.
–Weather,navigation,flightinfoNuanceFinance,Travel,…OGICSLUToolkitSpeechWorksFinance,Travel,.
.
.
UC-BerkeleyBERP–RestaurantinformationURochesterTRAINS–SchedulingtrainsEuropeCSELT(Italian)–TrainschedulesKTHWAXHOLM(Swedish)–FerryscheduleLIMSI(French)–Flight/trainschedulesNijmegen(Dutch)–TrainschedulePhilips(Dutch,Fr.
,German)–Flight/TrainschedulesVocalisVOCALIST(English)–FlightschedulesCSELT(Italian)–TrainschedulesKTHWAXHOLM(Swedish)–FerryscheduleLIMSI(French)–Flight/trainschedulesNijmegen(Dutch)–TrainschedulePhilips(Dutch,Fr.
,German)–Flight/TrainschedulesVocalisVOCALIST(English)–FlightschedulesLarge-scaledeploymentofsomedialoguesystems–e.
g.
,CSELT,Nuance,Philips,SpeechWorksIntroduction||NL||Development||Progress||ChallengesConversationalSystems476.
345AutomaticSpeechRecognition(2003)ExampleDialogueSystemsVocabulariestypicallyhave1000sofwordsWidelydeployedsystemstendtobemoreconservativeDirecteddialogueshavefewerwordsperutteranceWordaveragesloweredbymoreconfirmationsHuman-humanconversationsusemorewords051015202530CSELTSWPhilipsCMUCMUCLIMSIMITMITCAT&THumanAveWords/UttAveUtts/CallIntroduction||NL||Development||Progress||ChallengesConversationalSystems486.
345AutomaticSpeechRecognition(2003)SomeSpeechRecognitionResearchIssuesWidespreadrobustnesstoenvironments&speakers–Channelconditions:*Wide-band→telephone→cellular*Wide-band→microphonearrays(echocancellation)–Conversationalspeechphenomena–Speakervariation(native→non-native)Knowingwhatyoudon'tknow–Confidencescoring(utterance&word)–Out-of-vocabularyworddetection&additionBeyondwordn-grams–Providingcoverage,constraint,andaplatformforunderstandingOtherchallenges:–Adaptation(long-term→short-term)–Domain-independentacousticandlanguagemodellingIntroduction||NL||Development||Progress||ChallengesConversationalSystems496.
345AutomaticSpeechRecognition(2003)LanguageUnderstandingResearchIssuesVarietyofmethodsexploredtoachieverobustunderstanding–Fullgrammarswithback-offtorobustparse(e.
g,Seneff)–Semanticgrammars,template-basedapproaches(e.
g.
,Ward)–Stochasticspeech-to-meaningmodels(e.
g.
,Miller,Levinetal.
)–Ongoingworkinautomaticgrammaracquisition(e.
g.
,Roukosetal.
,Kuhnetal.
)Interfacemechanisms–Two-stageN-best/word-graphvs.
coupledsearch–HowdoweachieveunderstandingduringdecodingOngoingchallenges:–Domain-independentlanguageunderstanding–Willcurrentapproachesscaletomorecomplexorgeneralunderstandingtasks–Integrationofmultimodalinputsintoacommonunderstandingframework(e.
g.
,Cohen,Flanagan,Waibel)Introduction||NL||Development||Progress||ChallengesConversationalSystems506.
345AutomaticSpeechRecognition(2003)SomeDialogueResearchIssuesModelinghuman-humanconversations–Arehuman-humandialoguesagoodmodelforsystems–Ifso,howdowestructureoursystemstoenablethesamekindsofinteractionfoundinhuman-humanconversationsImplementationstrategies:–Directedvs.
mixed-initiativewithback-off(e.
g.
,Lameletal.
)–Machine-learningofdialoguestrategies(e.
g.
,Levinetal.
)Handlinguserdialoguephenomena–Interruptions(viabarge-in),anaphora,ellipsis–Barge-incanincreasecomplexityofdiscourseModelingagentdialoguephenomena–Back-channel(e.
g.
,N.
Ward)Otherissues:–Detectingandrecoveringfromerrors(e.
g.
,Walkeretal.
)–MatchingcapabilitieswithexpectationsIntroduction||NL||Development||Progress||ChallengesConversationalSystems516.
345AutomaticSpeechRecognition(2003)ConclusionSpokendialoguesystemsareneeded,dueto–Miniaturizationofcomputers–Increasedconnectivity–HumandesiretocommunicateTobetrulyuseful,theseinterfacesmustbeconversationalinnature–Embodylinguisticcompetence,bothinputandoutput–HelppeoplesolverealproblemsefficientlySystemswithlimitedcapabilitiesareemergingMuchresearchremainstobedone

licloud:$39/月,香港物理服务器,30M带宽,e3-1230v3/16G内存/1T硬盘

licloud官方消息:当前对香港机房的接近100台物理机(香港服务器)进行打折处理,30Mbps带宽,低至不到40美元/月,速度快,性价比高,跑绝大多数项目都是绰绰有余了。该款香港服务器自带启动、关闭、一键重装功能,正常工作日内30~60分钟交货(不包括非工作日)。 官方网站:https://licloud.io 特价香港物理服务器 CPU:e3-1230v2(4核心、8线程、3.3GH...

hosteons:10Gbps带宽,免费Windows授权,自定义上传ISO,VPS低至$21/年,可选洛杉矶达拉斯纽约

hosteons当前对美国洛杉矶、达拉斯、纽约数据中心的VPS进行特别的促销活动:(1)免费从1Gbps升级到10Gbps带宽,(2)Free Blesta License授权,(3)Windows server 2019授权,要求从2G内存起,而且是年付。 官方网站:https://www.hosteons.com 使用优惠码:zhujicepingEDDB10G,可以获得: 免费升级10...

6元虚拟主机是否值得购买

6元虚拟主机是否值得购买?近期各商家都纷纷推出了优质便宜的虚拟主机产品,其中不少6元的虚拟主机,这种主机是否值得购买,下面我们一起来看看。1、百度云6元体验三个月(活动时间有限抓紧体验)体验地址:https://cloud.baidu.com/campaign/experience/index.html?from=bchPromotion20182、Ucloud 10元云主机体验地址:https:...

saintpaulon为你推荐
企业邮局系统什么邮件系统最适合企业?thinksns网站成功 安装ThinkSNS后主页有问题重庆电信断网为什么电信宽带突然断网了360邮箱邮箱地址指的是什么?360公司迁至天津公司名字变更,以前在北京,现在在天津,跨地区了怎么弄?资费标准中国移动4g18元套餐介绍2828商机网千元能办厂?28商机网是真的吗?灌水机谁知道哪个好点的灌水机的地址?开源网店开源网店系统 独立网店系统 淘宝 有什么区别?discuz论坛Discuz论坛是什么啊?
vps代购 域名备案信息查询 漂亮qq空间 simcentric 鲨鱼机 evssl证书 好玩的桌面 工信部icp备案号 isp服务商 33456 怎么建立邮箱 东莞idc 台湾google 宏讯 免费的asp空间 阿里dns 中美互联网论坛 apachetomcat e-mail 服务器机柜 更多