RESEARCHARTICLEOpenAccessUsinginternetsearchqueriesforinfectiousdiseasesurveillance:screeningdiseasesforsuitabilityGabrielJMilinovich1,2*,SimonMRAvril3,ArchieCAClements4,JohnSBrownstein5,ShiluTong1andWenbiaoHu1AbstractBackground:Internet-basedsurveillancesystemsprovideanovelapproachtomonitoringinfectiousdiseases.
Surveillancesystemsbuiltoninternetdataareeconomically,logisticallyandepidemiologicallyappealingandhaveshownsignificantpromise.
Thepotentialforthesesystemshasincreasedwithincreasedinternetavailabilityandshiftsinhealth-relatedinformationseekingbehaviour.
Thisapproachtomonitoringinfectiousdiseaseshas,however,onlybeenappliedtosingleorsmallgroupsofselectdiseases.
Thisstudyaimstosystematicallyinvestigatethepotentialfordevelopingsurveillanceandearlywarningsystemsusinginternetsearchdata,forawiderangeofinfectiousdiseases.
Methods:Officialnotificationsfor64infectiousdiseasesinAustraliaweredownloadedandcorrelatedwithfrequenciesfor164internetsearchtermsfortheperiod2009–13usingSpearman'srankcorrelations.
Timeseriescrosscorrelationswereperformedtoassessthepotentialforsearchtermstobeusedinconstructionofearlywarningsystems.
Results:Notificationsfor17infectiousdiseases(26.
6%)werefoundtobesignificantlycorrelatedwithaselectedsearchterm.
Theuseofinternetmetricsasameansofsurveillancehasnotpreviouslybeendescribedfor12(70.
6%)ofthesediseases.
Themajorityofdiseasesidentifiedwerevaccine-preventable,vector-borneorsexuallytransmissible;crosscorrelations,however,indicatedthatvector-borneandvaccinepreventablediseasesarebestsuitedfordevelopmentofearlywarningsystems.
Conclusions:Thefindingsofthisstudysuggestthatinternet-basedsurveillancesystemshavebroaderapplicabilitytomonitoringinfectiousdiseasesthanhaspreviouslybeenrecognised.
Furthermore,internet-basedsurveillancesystemshaveapotentialroleinforecastingemerginginfectiousdiseaseevents,especiallyforvaccine-preventableandvector-bornediseases.
BackgroundPrudentdetectionisacornerstoneinthecontrolandpreventionofinfectiousdiseases.
Traditionalinfectiousdiseasesurveillancesystemsaretypicallycharacterisedbyabottom-upprocessofdatacollectionandinforma-tionflow;thesesystemsrequireapatienttorecogniseillnessandseektreatmentandaphysicianorlaboratorytodiagnosetheinfectionandnotifytherelevantauthor-ity[1,2].
Foremerginginfectiousdiseaseevents,thisprocessisreportedtotake,onaverage,15daysfromon-settodetectionandafurther12–24hoursfortheWorldHealthOrganizationtobenotified[3].
Thedevelopmentandimplementationofmoreefficientsystemsforgath-eringintelligenceoninfectiousdiseaseshasthepotentialtoreducetheimpactofdiseaseevents.
Internet-basedsurveillancesystemsareonesuchsystem[4].
Internet-basedsurveillancesystemsproduceestimatesofdiseaseincidencethroughanalysisofvariousdigitaldata-sources.
Targetedsourcesincludeinternet-searchmetrics,onlinenewsstories,socialnetworkdataandblog/*Correspondence:gabriel.
milinovich@qut.
edu.
au1SchoolofPublicHealthandSocialWork,QueenslandUniversityofTechnology,Brisbane,Australia2InfectiousDiseaseEpidemiologyUnit,SchoolofPopulationHealth,TheUniversityofQueensland,Brisbane,AustraliaFulllistofauthorinformationisavailableattheendofthearticle2014Milinovichetal.
;licenseeBioMedCentral.
ThisisanOpenAccessarticledistributedunderthetermsoftheCreativeCommonsAttributionLicense(http://creativecommons.
org/licenses/by/4.
0),whichpermitsunrestricteduse,distribution,andreproductioninanymedium,providedtheoriginalworkisproperlycredited.
TheCreativeCommonsPublicDomainDedicationwaiver(http://creativecommons.
org/publicdomain/zero/1.
0/)appliestothedatamadeavailableinthisarticle,unlessotherwisestated.
Milinovichetal.
BMCInfectiousDiseases(2014)14:690DOI10.
1186/s12879-014-0690-1microblogdata[4].
Currently,themostpromisingap-proachappearstobethosebaseduponmonitoringofinternetsearchbehaviour.
Thisapproachworksonthepremisethatpeoplewillactivelyseekinformationondis-easestheydevelopandthatestimatesofdiseaseactivitywiththecommunitymaybedevelopedbymonitoringthefrequencyofrelatedinternetsearches.
Throughtargetingpeopleearlierinthediseaseprocess,internet-basedsystemsareabletoaccessalargerfractionofthecom-munityandproducemoretimelyinformation.
Further-more,internet-basedsurveillancesystemsareintuitiveandadaptable,cheaptorunandmaintain(onceestab-lished),donotrequireaformalpublichealthnetworkandhavethecapacitytobeautomatedandoperateinnear-realtime.
Despitetheseadvantages,internet-basedsurveillancesystemshaveanumberofsignificantshort-comingsandmustnotbeconsideredanalternativetotraditionalsurveillanceapproaches[5].
Firstly,asthesesystemscrowd-sourcedata,resolutionwillbecontin-gentonthesizeofthepopulationservicedandmaybefurtherlimitedbynationalcommunicationsinfrastructureavailabilityanddistribution[6].
Secondly,asinternet-basedsurveillancesystemsarelimitedtopeoplewhousetheinternettosourcehealthinformation,thereisthepotentialthatestimatesproducedbythesesystemsmaynotaccuratelyreflecttheentirecommunity[7].
Finally,asinternet-basedsurveillancesystemsessentiallyrelyuponself-reporting,biasmaybeintroducedthroughdifferencesininternetusagebetweensectorsofthecommunity(theelderly,forexample,maynotusetheinternetasasourceofhealthinformation,despitebeingahigh-riskgroupformanyinfectiousdiseases)and/orthroughmediadriveninterestinemergingdiseaseevents[4].
Infectiousdiseasessurveillancesystemshavebeende-velopedusinginternetsearchmetricstoestimateinci-denceofinfluenza(GoogleFluTrends)[8]anddengue(GoogleDengueTrends)[9].
Currently,operationalsys-temsthatutilisethisapproacharelimited,however,stud-iesofthepotentialforinternet-basedsurveillancehavebeenconductedforarangeofotherinfectiousdiseases,including:acuterespiratoryillness[7],AIDS[10],chicken-pox[11,12],cryptosporidiosis[13],dysentery[10],gastro-enteritis[11],Hepatitis[14],listeriosis[15],Lymedisease[16],methicillin-resistantStaphylococcusaureus[17],nor-ovirus[18],respiratorysyncytialvirus[6],rotavirus[19],scarletfever(Streptococcuspyogenes)[10,20],Salmonella[21],tuberculosis[10,22]andWestNilevirus[6].
Previousstudieshavefocusedonsinglediseases,orasmallnumberofdiseases,andthejustificationofthefocusonaparticu-lardiseasehasbeenspecifictoeachstudy.
Thepublishedresultshavelargelybeenpromising;however,todatetherehasbeennosystematic,generalizableanalysistoidentify-ingdiseasesthataresuitedtomonitoringthroughtheanalysisofinternet-searchmetrics.
Theunderpinninggoalofthisstudywastoprovidedirectionforfutureapproachestodevelopingdigitalsur-veillancesystems;suchasthedevelopmentofpredictivemodelsand/orintegrativesurveillancemodelsthatdrawuponmultipletraditionalanddigitaldatasourcetocreateestimatesofdiseasewithinthecommunity.
Thisstudy,however,didnotaimtodevelopactionablesurveillancesystems,producepredictivemodelsofinfectiousdiseasebasedoninternet-baseddataortoidentifythebestsearchtermsforuseinthesemodels.
Rather,thisstudyaimedtodeterminewhichdiseaseshavemostpromiseformonitor-ingbysurveillancesystemsbuiltoninternetsearchmet-rics;thiswasachievedbyassessingthelevelofcorrelationbetweenawiderangeofinfectiousdiseasesandinternetsearchtermmetrics.
Finally,thisstudyaimstoidentifydiseasesforwhichinternet-baseddatacouldbeusedtocreateearlywarningsystems.
MethodsInfectiousdiseasesurveillancedataSurveillancedataonnotifiableinfectiousdiseaseswerecol-lectedfromtheNationalNotifiableDiseaseSurveillanceSystem(NNDSS)whichismaintainedbytheAustraliaGovernmentDepartmentofHealth(DoH)[23].
Monthlynotifications(casenumbers)aggregatedatstate/terri-toryandnationallevel,weredownloadedfortheperiodofJanuary2004toSeptember2013.
Afulllistofnotifi-ablediseasesinAustraliaandcasedefinitionscanbeaccessedthroughtheDoHwebpage[24].
Sixty-fourdis-easesaremonitoredandthesearecategorisedintheNNDSSasbelongingtooneofeightgroups:blood-bornediseases;gastrointestinaldiseases;otherbacterialdiseases;quarantinablediseases;sexuallytransmissibleinfections;vector-bornediseases;vaccinepreventablediseases;andzoonoses.
Forthepurposeofconsistency,wehavereporteddiseasesaccordingtothesegroupings.
Whilstnotifiable,datawerenotdownloadedforhumanimmunodeficiencyvirusinfection/acquiredimmuno-deficiencysyndrome,Creutzfeldt–Jakobdiseaseorvari-antCreutzfeldt–JakobdiseasebecausesurveillanceforthesediseasesisnotperformedbyDoHorforsevereacuterespiratorysyndrome,becausereportingtotheDoHisinformal;assuch,thesediseasesarenotlistedontheNNDSS.
SearchtermselectionandscrapingofinternetsearchtrenddataIntheconstructionofGoogleFluTrendsmodel,theau-thorsidentifiedsearchtermsbyperformingcorrelationsbetweeninfluenza-likeillnessdatafromtheUSCDCandthetop50millionGooglesearchqueriesperformedintheUSoverthecorrespondingperiod[8].
Suchdataisnotavailabletothepublicandanalternativeapproachtoiden-tificationofsearchtermswasrequired;twoapproachesMilinovichetal.
BMCInfectiousDiseases(2014)14:690Page2of9wereused.
Firstlytermsrelatedtodiseases,theaetiologicalagentsandcolloquialisms(suchas"hep"forhepatitisor"flu"forinfluenza)weremanuallyidentified.
Secondly,GoogleCorrelate(www.
google.
com/trends/correlate)wasqueriedusingmonthlysurveillancedata(describedabove).
GoogleCorrelateprovidesalistofupto100searchtermsthatcorrelatemosthighlywiththequerydata.
Toaccountforpotentiallanguageshiftsthatmayhaveaffectedsearchbehaviour[4],thiswasperformedthreetimesusingsur-veillancedatacoveringtheperiods2004–13,2007–13and2011–13.
Upto300searchtermsweredownloadedfromGoogleCorrelateforeachnotifiabledisease(100searchtermsperperiodanalysed)andmanuallysorted;anytermrelatedtothequeriednotifiablediseasewasincluded,regardlessofthenatureofthepotentialassociationSuitabletermswerecombinedwiththemanuallyidenti-fiedsearchtermstocreatealistofsearchterms(seeAdditionalfile1).
Noattemptwasmadetofiltersearchtermsbaseduponbiologicalplausibility;anytermthatmaybeperceivedtohaveanyassociationwiththediseaseofinterestwasincluded.
SearchfrequenciesfortermsofinterestwerecollectedthroughGoogleTrends(www.
google.
com/trends/).
Alldataextractionswereperformedonthe22ndofOctober,2013.
GoogleTrendswasqueriedusingeachoftheiden-tifiedtermsatanationalandstate/territorylevelusingtheentiretimerangeavailable(2004–present).
GoogleTrendspresentssearchfrequencyasanormaliseddataserieswithvaluesrangingfrom0to100(with100repre-sentingthepointwiththehighestsearchfrequencyandotherpointsscaledaccordingly);functionalityforexport-ingsearchfrequencydataasa.
CSVfileisprovided.
Forthepurposeofprivacy,dataareaggregatedatadaily,weeklyormonthlylevel(orarerestrictedifthereisinsuf-ficientsearchvolume).
Thelevelofaggregationappliedisdeterminedbytheperiodanalysedandthesearchfre-quency;thelevelofaggregationisnotabletobespecifiedbytheuser.
Asthenotifiablediseasesurveillancedatausedwasinmonthlyformat,monthlyindicesofquerysearchfrequencieswererequired.
Monthlyindicesaredis-playedgraphicallybyGoogleTrendswhenqueryingpe-riodsgreaterthan36months;ratherthandownloading.
CSVfiles,ascriptwasdevelopedtoscrapedatafromtheGoogleTrendswebpage,allowingtheproblemsassociatedwiththelevelofdataaggregationtobeovercome.
DataanalysisAnalyseswereperformedatbothnationalandstatelevelsfortheperiod2009–13.
Asstate-levelsearchfrequencydatawerenotalwaysavailable,particularlyforlesscom-mondiseases(duetolowsearchfrequencyatthislevelofdisaggregation),correlationsbetweenstate-levelnotifica-tiondataandnationalsearchfrequencydatawerealsoperformed.
Owingtothelargenumberofcorrelationsperformedinthisstudy,Bonferroniadjustments[25]wereappliedtosignificancelevelsbytheequation1-(1-α)1/n;allp-valuesreportedinthisdocumentcorrespondtoone-tailedtests.
Spearman'srankcorrelationcoefficientswereusedtorankperformance.
Time-seriescrosscorrelationswereperformedtoas-sesslinearassociationsbetweendiseasenotificationsandGoogleTrendsearchindices.
CrosscorrelationswerecalculatedusinglagvaluesforGoogleTrendsdataran-gingfrom7to7.
Thisrangeallowedforassessmentofbiologicallyplausibleassociationsthatwererelevanttothedevelopmentofearlywarningsystems.
Crosscorre-lationswereperformedonnationaldatausingIBMSPSSversion21(SPSSInc;Chicago,IL,USA).
Seasonaldiffer-encingwasapplied(value1)toallanalysestoremovecyclictrends.
Whilstallavailabledata(2004–13)weredownloaded,analysesforthisstudywerefocusedonthemostrecentfiveyears(2009–13)aspreliminarydataanalysesindi-catedthatGoogleTrendsdatawerenotavailablepriorto2009fornumeroussearchterms(Figure1;panels2,4,9,12,16and17).
Additionally,shiftsinlanguageareknowntoaffectsurveillancesystemsbuiltupontextualdata[4].
Theshortenedperiod(2009–13)wasselectedtominimisetheeffectsoflanguageshifts.
However,thisperiodstillprovidestherequisite50pairsofobservationsforperformingcrosscorrelations[26].
ResultsInthissectionwediscussanalysesoftimeseriesdata.
Briefly,thetimeseriesanalysedweremonthlycasenumbersforthe64infectiousdiseasesmonitoredbytheAustralianGovernment'sNationalNotifiableDiseaseSurveillanceSystem(NNDSS)andGoogleTrendsmonthlysearchmetricsforrelatedinternetsearchterms.
Intotal,search164termswereanalysedinthisstudy;thisrangedfromasingletermforsomediseases,upto14searchtermsforinfluenzaand35searchtermsforpneumococcaldisease.
Themajorityoftermscouldbecategorisedasdiseasesoraetiologicalagents("brucellosis"or"Brucella"),colloquialisms("flu","hep"or"TB"),symptoms("cough","whitedischarge"or"cervicalmucus")ormedicationorgeneralhealth/treatmentrelatedqueries("whoopingcoughtreatment","symptomsofdengue"or"fluandpregnancy").
Afewtermsthatmayhaveenvironmental("flashfloods"forleptospirosis)orbehavioural("Africantours"formal-aria)meaningswerealsoincluded.
Afulllistofthesearchtermsanalysedispresentedinthesupplementarymaterial.
Spearman'scorrelationsEvaluationofthebivariateassociationsbetweensurveil-lanceandcorrespondingsearchfrequencydatawasper-formedusingtheSpearman'srankcorrelation.
Spearman'srankcorrelationsforthe18toprankednotifiablediseasesMilinovichetal.
BMCInfectiousDiseases(2014)14:690Page3of9Figure1Topinternetsearchtermsanalysedfor18diseaseswiththehighestSpearman'srhovalues(2009–13).
Nationalmonthlycasenumbers(blue)andAustralianGoogleTrendsearchindex(red).
GoogleTrendsearchtermsusedintheanalysisarepresentedinFigure2.
Milinovichetal.
BMCInfectiousDiseases(2014)14:690Page4of9andtermsarepresentedinFigure2andrawdataforthecorrespondingdiseasesandsearchtermsarepresentedinFigure1.
ResultsofSpearman'scorrelationsindicated17diseasestobesignificantlycorrelated(pGoogleTrends'data)hasbeenshiftedbackwardsoneunit(amonth).
Conversely,alagvalueof1indicatesthattheprimaryserieshadbeenshiftedforwardoneunit.
Signifi-cantpositivecorrelationsforlagvalesof≥1oraboveareofmostinterestinthecontextofthisstudyastheyindicateapositiverelationshipbetweenthetwotimeserieswithGoogleTrendsdataleadingthenotifications(apre-requisiteforGoogleTrendsdatatobeasuitableearlywarningtool).
Itshouldalsobenotedthatseasonaldiffer-encingwasappliedtocrosscorrelationstoremovecyclicseasonaltrends.
Diseasenotificationspositivelycorrelatedatalagofonemonth(lag1)withsearchtermfrequencyfor12ofthe17diseasesthatexhibitedsignificantSpearman'srankcorrelations.
Overall,15ofthe64notifiablediseasesexhibitedsignificant,positivecorrelationsatlagofonemonth.
Significantpositiveassociationswereobservedforfouroftheninevector-bornediseases(BarmahForestvirusinfection,Denguevirusinfection,MurrayValleyencephalitisvirusinfectionandRossRivervirusinfection),sixofthe14vaccinepreventablediseases(Haemophilusinfluenzaetypeb,influenza,pertussis,pneumococcaldiseaseandvaricellazoster(chickenpoxandshingles)),twoofthesixblood-bornediseases(hepatitisB(unspecified)andC(unspecified)),twoof11gastrointestinaldiseases(campylobacteriosisandcryptosporidiosis)andonezoonosis(leptospirosis).
Positivesignificantcorrelationswerenotobservedatalagofonemonthforanyofthequarantinablediseases(n=6),sexuallytransmissibleinfections(n=6)orotherbacterialinfections(n=4).
Itshouldbenotedthatposi-tivesignificantcorrelationswereobservedatlagsofoveronemonth(butnotatlag1)fortwoofthetopranked18diseases(gonococcalinfectionandmeningo-coccaldisease)and16diseasesoverall(seeAdditionalfile1).
Additionally,theterms"haemolyticuraemicsyndrome"and"leprosy"exhibitedsignificantnegativecorrelationswiththerespectivediseasenotificationsatalagofonemonth.
Figure2Spearman'srhovaluesforthe18toprankednotifiablediseasesfortheperiod2009–13.
Thetableonlycontainsthesearchtermwiththehighestdegreeofcorrelationforeachdisease;seeAdditionalfile1forafulllistofdiseases,searchtermsandcorrelationcoefficients.
ThecolumnlabelinboldindicatestheGoogleTrendsdatausedandsubheadingsinitalicsindicatethediseasenotificationdataused.
CasenumbersareNationaltotalsfortheperiod2009–13.
Shadingdenotedstatisticalsignificance(one-tailed,Bonferronicorrected)at0.
0001(red),0.
001(orange),0.
01(yellow)and0.
05(green)levels.
Fordiseasegrouping,BB:Blood-bornediseases;GI:Gastrointestinaldiseases;Other;Otherbacterialdiseases;QD;Quarantinablediseases;STI:SexuallyTransmissibleInfections;VBD:Vector-borneDiseases;VPD:Vaccinepreventablediseases;Zoo:Zoonoses.
Milinovichetal.
BMCInfectiousDiseases(2014)14:690Page5of9Figure3(Seelegendonnextpage.
)Milinovichetal.
BMCInfectiousDiseases(2014)14:690Page6of9DiscussionThedevelopmentandapplicationofinternet-basedinfec-tiousdiseasesurveillancesystemshasthepotentialtoenhanceinfectiousdiseasecontrolandprevention.
Whilstthisiswidelyrecognised[4,6,7,12,15,16,18,20]theinvesti-gationandapplicationofinternet-basedsurveillancehasnotbeensystematicallyappliedacrossinfectiousdiseases;thelackofsystemicknowledgeregardingthepotentialbreadthofinternet-basedsurveillanceappearstohaverestrictedthedevelopmentofsystemstoasmallnumberofdiseases.
Toourknowledge,assessmentsoftheuseofinternet-basedsurveillancehaveonlybeenperformedforfiveofthe17diseasesthatweredemonstratedtohaveasignificantassociationwithinternetsearchterms(influ-enza[4],dengue[9,27],chickenpox[11,12],hepatitisB[14]andcryptosporidiosis[13]–theauthorsofthefinalstudywere,however,notabletodetectsignalsfrominternetsearchqueries).
Ourstudysuggeststhatinternet-basedsurveillancesystemshavepotentialapplicationtoawiderrangeofdiseasesthaniscurrentlyrecognised.
How-ever,correlationsaloneshouldnotbeviewedasdefinitiveevidencethatsuchsystemsareviable;somediscretionmustbeapplied,particularlyastheanalysesperformedwereunivariate.
Correlationsbetweeninternetmetricsandbothgonococcalinfectionandchlamydia(Figure1,boxes2and7)werehigh;thisappearstobeduetoagen-eralupwardtrendinbothandinternetmetricsappearstohavelittlevalueindetectingperturbationsincasesbeyondthis.
Thisissupportedbythecrosscorrelationresults(whichareseasonallydifferenced);despitebeingranked2ndand7thbySpearmanrho(Figure2),nopositivecorrelationswereobservedforthesedisease/searchtermcrosscorrelations,evenatlag0(Figure3).
Furtherre-searchneedstobeperformed;however,thisstudysug-gestssurveillancesystemsbuildoninternetsearchdatatohavesignificantpromiseforanumberofdiseasesbeyondthosepreviouslydescribed,mostnotablypneumococcaldisease,RossRivervirusinfection,pertussis,BarmahForestvirusandinvasivemeningococcaldisease.
Theapplicationofinternet-baseddatatomonitoringsystemsofinteresthasbeentermed"nowcasting";thisapproachdoesnotpredicttheoccurrenceoffutureevents,butratherseekstoproducemoretimelyinformationonthesystemsofinterest[28].
Forinfectiousdiseasesurveil-lance,thisistypicallyachievedthroughtheabilityofinternet-basedsurveillancesystemstocollectdataatanearliertimepointthanispossiblefortraditionalsystemsorbycircumventingbureaucraticstructuresinherenttotraditionalsystemsthatimpedeinformationflow[4].
Searchtermsthatexhibitahighlevelofcorrelationwithdiseasenotificationsareofvalueastheymaybeusedtoprovidefasterintelligenceonemergingdiseaseevents.
Resultsofcrosscorrelations(Figure3),however,indi-catedthatforecastingofinfectiousdiseaseeventsmayalsobepossibleusinginternet-baseddata.
Ofthe17dis-easesthatexhibitedsignificantSpearman'scorrelations,12alsohadsignificantpositivecrosscorrelationsatalagofonemonth.
Overall,crosscorrelationsindicatedthatforecastingofnotificationratesusinginternet-basedmet-ricswouldbemostrealisticforthevaccine-preventableandvector-bornediseases.
Despitesearchtermsofferingstrongorverystrongcorrelationsfortwoofthesexuallytransmissiblediseases,neitherexhibitedsignificantcorre-lationsatalagofonemonth.
Whilstinternetmetricsmayprovidevaluableinforma-tionregardingdiseasestatus,itisimportanttoviewthesewithincontext.
Theterm"denguemosquito"(Figure3,panel6)leadsnotificationsbyuptoonemonth.
Thedataimplydependenceofdenguenotificationsonsearchesfortheterm"denguemosquito".
Themechanismofthisde-pendenceismorelikelythatenvironmentalconditionsthatincreasetheabundanceofmosquitosindengueriskareascorrelatewithbothanincreaseindenguenotifica-tionsandincreasedsearchinterestfor"denguemosquito",allowingthesearchtermtobeusedasanindicatorforno-tifications.
Inthiscontexttheinternetmetricsalsoprovideinformationthatisofpotentialsignificancewithrespecttocontrolofdenguefever;thereisincreasedinterestre-gardingmosquitosinthecommunityandthismaybedrivenbyanincreaseinmosquitonumbers.
Converselytheincidenceofdiseaseinthecommunitymayalsoaffectsearchhabits.
Thesearchterm"chikungunya"lagsnotifi-cationsforchikungunyavirusinfection(Figure3,panel18).
Searchesfor"chikungunya"areprobablydrivenbymediaexposure.
Mediabiashaspreviouslybeenreportedtoadverselyaffectinternet-basedsurveillancesystems[27,29-33]andanincreaseincasesofadiseaseinthecommunitywilllikelyresultinthepublicationofstoriesaboutthediseaseinthemedia;inturn,mediaexposurewilldriveinternetsearchesonthetopic.
Theseprocesses,however,arenotnecessarilymutuallyexclusive.
Searchesforadiseasemayleadnotifications,however,increasednotificationsandreportingofanemergingdiseaseeventinthemediamayalsodriveinternetsearches.
Thecom-plexityofthisrelationshipmaymakeinterpretationofGoogleTrends'datamoredifficult.
Forpertussis(Figure3,(Seefigureonpreviouspage.
)Figure3Crosscorrelationresultsforthe18diseaseswiththehighestSpearman'srhovalues(2009–13).
Crosscorrelationsfortwosearchtermsaredisplayedforeachdisease.
ColouredbarscorrespondtothesearchtermwiththehighestSpearman'srhovalueforeachdisease(redbarsindicatevaluesthatexceedthe95%confidenceinterval,whereasbluebarsdonot).
Unfilledbarsindicatecrosscorrelationresultsforalternativesearchtermswithhighestcrosscorrelationvaluesatalagvalueof1.
Confidenceintervals(95%)areindicatedbythegreylines.
Milinovichetal.
BMCInfectiousDiseases(2014)14:690Page7of9panel8),theterm"whooping"exhibitsasignificantposi-tivecorrelationwithdiseasenotificationsfromlag7throughtolag3.
Itappearsthatbothmechanismsoccurforthesameterm,demonstratingapotentialdifficultyininterpretingthesedata.
Itisimperativethatanytermsusedinthedevelopmentofforecastingmodelsareheav-ilyscreenedtoaddressthecomplexitiesofthedrivingforcesbehindhealth-informationseekingandroutinelyre-evaluatedtoaccountforanyshiftsinsearchbehav-iourwhichmayoccur[4].
Therewereanumberofobviouslimitationstothisstudy.
Thetemporalresolutionofthedatausedwasmonthly.
Internet-basedsurveillancesystemsbuiltuponmonthlydataareunlikelytoprovidebetterintelligencethanexistingtraditionalsurveillancesystems;thesecom-monlyrelyuponweeklyordailyreporting.
Thiswasafunctionoftheavailabilityofthenotificationdata.
Sec-ondly,theanalyseswereperformedforaspecificsetting:Australia.
Thenuancesoflanguagewillcreatediffer-encesintheapplicability,notjustfordifferentcountries,butalsowithinacountryandbetweendifferentsettings(suchasduringaninfluenzapandemic)[4].
Australiawasselectedasthestudyareabecauseinternetpenetra-tioninAustraliaisveryhigh(>80%)[34]anduseislargelyrestrictedtoasinglesearchengine;Googlemaintainsamarketshareofover90%inAustralia[35].
Thesefeaturesreducebiasesassociatedwithunequalpatternsofuseand/oraccess.
Additionally,owingtoitsextensivesize,Australiaexhibitsarangeofclimatesandvaryingenviron-mentalconditions,makingitsusceptibletoawiderangeofinfectiousdiseases,includingendemicandnon-endemicvector-bornediseases.
Additionally,Australiahasastrongpublichealthnetworkandcomprehensiveinfec-tiousdiseasesurveillancesystemswhichcompilehighqualitydataonarangeofdiseases.
Combined,thesefea-turesofinternetusageandavailability,infectiousdiseasesurveillancesystemsanddiseasessusceptibilitypatternsmakeAustraliaanidealsysteminwhichtostudythepo-tentialapplicationofinternet-basedsurveillancesystems.
Itishopedthatthisworkwillstimulatefurtherresearchintointernet-basedinfectiousdiseasesurveillancesystemsbeyondAustralia.
Evenwithinourownstudy,however,weobservedvariationincorrelationsbetweeninternetsearchmetricsanddiseasenotificationsforthevariousstates(Figure2).
Itisimperativetodevelopmodelsspecifictotheregionofinterestandtoassesstheperformanceofanyinternet-basedsystemagainsttraditionalsurveillancedataspecifictotheregionbeingmonitored.
Thirdly,thisstudyanalysedtheperformanceofonlysinglesearchtermsinestimatinginfectiousdiseasenotifications.
WhilstGooglehasnotrevealedthetermsutilised,ortheweightingsapplied,GoogleFluTrendsisreportedtoincorporatearound160searchterms[36].
Despiteusingonlyasinglesearchtermforeachanalysis,notificationsfor13diseaseswereidentifiedashavingastrongorverystrongcorrel-ationwiththeselectedsearchterms.
CompoundingthisisthefactthatBonferroniadjustmentswereappliedinasses-singsignificance.
BonferroniadjustmentshavepreviouslybeencriticisedforbeingoverlyconservativeandforincreasingtheoccurrenceoftypeIIerrors(falsenegatives)[25].
Assuch,whilstthisstudyprovidesabaseforfutureresearch,itwouldberemisstolimitfutureinvestigationstojustthesediseases.
Thisstudyidentifiednumerousinfectiousdiseasesofpublichealthsignificancethathadnotpreviouslybeenin-vestigatedtohavepotentialformonitoringusinginternet-basedsurveillancesystemsHowever,thisstudydidnotseektoproducerobust,accurate,internet-basedsurveil-lancesystemsorearlywarningsystemsthatareabletoproduceactionableandtimelydataforpublichealthunits.
Theaimofthisstudywastoidentifythediseasesforwhichthisispossibleandtofocusfutureresearcheffortsintothese.
Toachievethisaim,thisstudyusedunivariateanalysestodeterminetheusefulnessofinternetsearchmetricsformonitoringawiderangeofinfectiousdiseases.
Whilstthissimplisticapproachwasusefulforscreeningdiseases,itwillnotsufficeinmonitoringorforecastingincidence.
Futurestudiesshouldfocusondevelopingcompositeindexesincorporatemultiplesearchterms,ordatasources(suchasweatherdata).
Modelsbuiltinsuchamanneraremoreresilienttomedia-drivenbe-haviour,fear-basedsearchingandevolutionsinlanguage[4].
Internet-basedsurveillancesystemshavethepoten-tialtobeappliedtomorethanjustenumeratingdiseasecaseswithinthecommunityorpredictingtheonset,peakandmagnitudeofoutbreaks.
Internet-basedsys-temsalsohavevalueastoolsforplanningemergencydepartmentstaffingandsurgecapacity[31,37]orforhealthcareutilisation[38].
Futureresearchneedstoalsoinvestigatetoapplicationofinternet-baseddata;thegreatestchallengeinthisfieldmaynotactuallybecreat-ingmodelsforforecastingormonitoringdiseasewithinthecommunity,butratherapplyingandarticulatingthesignificanceinamannerthatisbeneficial.
ConclusionsInternet-basedsurveillancesystemshavebroaderapplic-abilityforthemonitoringofinfectiousdiseasesthaniscurrentlyrecognised.
Furthermore,internet-basedsur-veillancesystemshaveapotentialroleinforecastingofemerginginfectiousdiseaseevents.
AdditionalfileAdditionalfile1:CompletetablesofresultsforGoogleCorrelateSearches,GoogleTrendsdata,SpearmanCorrelationsandcrosscorrelations.
Milinovichetal.
BMCInfectiousDiseases(2014)14:690Page8of9CompetinginterestsTheauthorsdeclarethattheyhavenocompetinginterests.
Authors'contributionsGJMandWHdevelopedtheoriginalideaforthisstudy.
DevelopmentofthescriptfordatacollectionwasperformedbySMRA.
DataanalysiswasperformedbyGJMwiththeassistanceofWH,JSB,STandACAC.
ThemanuscriptwasprimarilywrittenbyGJMwitheditorialadvicefromWH,SMRA,JSB,STandACAC.
Allauthorsreadandapprovedthefinalmanuscript.
AcknowledgmentsThesalaryforGJMwasprovidedthroughtheAustralianNationalHealthandMedicalResearchCouncil(grant#1002608)andtheAustralianResearchCouncil(grant#DP110100651).
ACACisfundedbyanAustralianNationalHealthandMedicalResearchCouncilSeniorResearchFellowship(#APP1058878).
JSBissupportedbygrant5R01LM010812-04fromtheNationalLibraryofMedicine.
WHisfundedbyaQueenslandUniversityofTechnologyVice-ChancellorSeniorResearchFellowship.
STisfundedbyaNHMRCSeniorResearchFellowship(#553043).
Authordetails1SchoolofPublicHealthandSocialWork,QueenslandUniversityofTechnology,Brisbane,Australia.
2InfectiousDiseaseEpidemiologyUnit,SchoolofPopulationHealth,TheUniversityofQueensland,Brisbane,Australia.
3Freelancedeveloper,Bundaberg,Australia.
4ResearchSchoolofPopulationHealth,ANUCollegeofMedicine,BiologyandEnvironment,TheAustralianNationalUniversity,Canberra,Australia.
5DepartmentofPediatrics,HarvardMedicalSchoolandChildren'sHospitalInformaticsProgram,BostonChildren'sHospital,Boston,USA.
Received:5December2014Accepted:9December2014References1.
Castillo-SalgadoC:Trendsanddirectionsofglobalpublichealthsurveillance.
EpidemiolRev2010,32(1):93–109.
2.
ZengX,WagnerM:Modelingtheeffectsofepidemicsonroutinelycollecteddata.
JAmMedInformAssoc2002,9:S17–S22.
3.
ChanEH,BrewerTF,MadoffLC,PollackMP,SonrickerAL,KellerM,FreifeldCC,BlenchM,MawudekuA,BrownsteinJS:Globalcapacityforemerginginfectiousdiseasedetection.
ProcNatlAcadSciUSA2010,107(50):21701–21706.
4.
MilinovichGJ,WilliamsGM,ClementsACA,HuW:Internet-basedsurveillancesystemsformonitoringemerginginfectiousdiseases.
LancetInfectDis2014,14(2):160–168.
5.
LazerD,KennedyR,KingG,VespignaniA:Bigdata.
TheparableofGoogleFlu:trapsinbigdataanalysis.
Science2014,343(6176):1203–1205.
6.
CarneiroHA,MylonakisE:Googletrends:aweb-basedtoolforreal-timesurveillanceofdiseaseoutbreaks.
ClinInfectDis2009,49(10):1557–1564.
7.
ValdiviaA,Lopez-AlcaldeJ,VicenteM,PichiuleM,RuizM,OrdobasM:MonitoringinfluenzaactivityinEuropewithGoogleFluTrends:comparisonwiththefindingsofsentinelphysiciannetworks-resultsfor2009–10.
Eurosurveillance:bulletineuropeensurlesmaladiestransmissibles=Europeancommunicablediseasebulletin2010,15(29):pii=19621.
8.
GinsbergJ,MohebbiMH,PatelRS,BrammerL,SmolinskiMS,BrilliantL:Detectinginfluenzaepidemicsusingsearchenginequerydata.
Nature2009,457(7232):1012–1014.
9.
ChanEH,SahaiV,ConradC,BrownsteinJS:Usingwebsearchquerydatatomonitordengueepidemics:anewmodelforneglectedtropicaldiseasesurveillance.
PLoSNeglTropDis2011,5(5):e1206.
10.
ZhouXC,ShenHB:Notifiableinfectiousdiseasesurveillancewithdatacollectedbysearchengine.
JZhejiangUniv-SCIC2010,11(4):241–248.
11.
PelatC,TurbelinC,Bar-HenA,FlahaultA,ValleronA:MorediseasestrackedbyusingGoogletrends.
EmergInfectDis2009,15(8):1327–1328.
12.
ValdiviaA,Monge-CorellaS:DiseasestrackedbyusingGoogletrends,Spain.
EmergInfectDis2010,16(1):168.
13.
AnderssonT,BjelkmarP,HulthA,LindhJ,StenmarkS,WiderstromM:Syndromicsurveillanceforlocaloutbreakdetectionandawareness:evaluatingoutbreaksignalsofacutegastroenteritisintelephonetriage,web-basedqueriesandover-the-counterpharmacysales.
EpidemiolInfect2014,142(2):303–313.
14.
ZhouX,LiQ,ZhuZ,ZhaoH,TangH,FengY:Monitoringepidemicalertlevelsbyanalyzinginternetsearchvolume.
IEEETransBiomedEng2013,60(2):446–452.
15.
WilsonK,BrownsteinJS:Earlydetectionofdiseaseoutbreaksusingtheinternet.
CanMedAssocJ2009,180(8):829–831.
16.
SeifterA,SchwarzwalderA,GeisK,AucottJ:Theutilityof"Googletrends"forepidemiologicalresearch:Lymediseaseasanexample.
GeospatHealth2010,4(2):135–137.
17.
DukicVM,DavidMZ,LauderdaleDS:Internetqueriesandmethicillin-resistantstaphylococcusaureussurveillance.
EmergInfectDis2011,17(6):1068–1070.
18.
DesaiR,HallAJ,LopmanBA,ShimshoniY,RennickM,EfronN,MatiasY,PatelMM,ParasharUD:NorovirusdiseasesurveillanceusingGoogleinternetquerysharedata.
ClinInfectDis2012,55(8):E75–E78.
19.
DesaiR,LopmanBA,ShimshoniY,HarrisJP,PatelMM,ParasharUD:UseofinternetsearchdatatomonitorimpactofrotavirusvaccinationintheUnitedStates.
ClinInfectDis2012,54(9):e115–e118.
20.
SamarasL,Garcia-BarriocanalE,SiciliaMA:SyndromicsurveillancemodelsusingWebdata:thecaseofscarletfeverintheUK.
InformHealthSocCare2012,37(2):106–124.
21.
BrownsteinJS,FreifeldCC,MadoffLC:Digitaldiseasedetection–harnessingtheWebforpublichealthsurveillance.
NEnglJMed2009,360(21):2153–2155,2157.
22.
ZhouX,YeJ,FengY:TuberculosissurveillancebyanalyzingGoogletrends.
IEEETransBiomedEng2011,58(8):2247–2254.
23.
NationalNotifiableDiseasesSurveillanceSystem.
[http://www9.
health.
gov.
au/cda/source/cda-index.
cfm]24.
Australiannationalnotifiablediseasesandcasedefinitions.
[http://www.
health.
gov.
au/internet/main/publishing.
nsf/Content/cdna-casedefinitions.
htm]25.
PernegerTV:What'swrongwithBonferroniadjustments.
BMJ:BritishMedicalJournal1998,316(7139):1236.
26.
BoxGE,JenkinsGM,ReinselGC:TimeSeriesAnalysis:ForecastingandControl.
NewJersey:Wiley;2008.
27.
AlthouseBM,NgYY,CummingsDA:Predictionofdengueincidenceusingsearchquerysurveillance.
PLoSNeglTropDis2011,5(8):e1258.
28.
ChoiHY,VarianH:PredictingthepresentwithGoogletrends.
EconRec2012,88:2–9.
29.
HulthA,RydevikG:Webquery-basedsurveillanceinSwedenduringtheinfluenzaA(H1N1)2009pandemic,April2009toFebruary2010.
Eurosurveillance:bulletineuropeensurlesmaladiestransmissibles=Europeancommunicablediseasebulletin2011,16(18):pii=19856.
30.
OrtizJR,ZhouH,ShayDK,NeuzilKM,FowlkesAL,GossCH:MonitoringinfluenzaactivityintheUnitedStates:acomparisonoftraditionalsurveillancesystemswithGoogleFlutrends.
PLoSOne2011,6(4):e18687.
31.
DugasAF,HsiehYH,LevinSR,PinesJM,MareinissDP,MoharebA,GaydosCA,PerlTM,RothmanRE:GoogleFlutrends:correlationwithemergencydepartmentinfluenzaratesandcrowdingmetrics.
ClinInfectDis2012,54(4):463–469.
32.
WattsG:Googlewatchesoverflu.
BMJ(Clinicalresearched)2008,337:a3076.
33.
McDonnellWM,NelsonDS,SchunkJE:Shouldwefear"flufear"itselfEffectsofH1N1influenzafearonEDuse.
AmJEmergMed2012,30(2):275–282.
34.
WorldTelecommunication/ICTIndicatorsDatabase2013(17thEdition).
[http://www.
itu.
int/en/ITU-D/Statistics/Pages/publications/wtid.
aspx]35.
StatCounterGlobalStats-Top5seachenginesinAustraliafrom2008to2013.
[http://gs.
statcounter.
com/#search_engine-AU-yearly-2008-2013]36.
CookS,ConradC,FowlkesAL,MohebbiMH:AssessingGoogleflutrendsperformanceintheUnitedStatesduringthe2009influenzavirusA(H1N1)pandemic.
PLoSOne2011,6(8):e23610.
37.
ArazOM,BentleyD,MuellemanR:UsingGoogleFluTrendsDatainForecastingInfluenza-Like–IllnessRelatedEmergencyDepartmentVisitsinOmaha,Nebraska.
TheAmericanjournalofemergencymedicine2014,InPress.
38.
SchusterNM,RogersMA,McMahonLFJr:Usingsearchenginequerydatatotrackpharmaceuticalutilization:astudyofstatins.
AmJManagCare2010,16(8):e215–e219.
Milinovichetal.
BMCInfectiousDiseases(2014)14:690Page9of9
数脉科技怎么样?数脉科技品牌创办于2019,由一家从2012年开始从事idc行业的商家创办,目前主营产品是香港服务器,线路有阿里云线路和自营CN2线路,均为中国大陆直连带宽,适合建站及运行各种负载较高的项目,同时支持人民币、台币、美元等结算,提供支付宝、微信、PayPal付款方式。本次数脉科技给发来了新的7月促销活动,CN2+BGP线路的香港服务器,带宽10m起,配置E3-16G-30M-3IP,...
2021年各大云服务商竞争尤为激烈,因为云服务商家的竞争我们可以选择更加便宜的VPS或云服务器,这样成本更低,选择空间更大。但是,如果我们是建站用途或者是稳定项目的,不要太过于追求便宜VPS或便宜云服务器,更需要追求稳定和服务。不同的商家有不同的特点,而且任何商家和线路不可能一直稳定,我们需要做的就是定期观察和数据定期备份。下面,请跟云服务器网(yuntue.com)小编来看一下2021年国内/国...
10gbiz怎么样?10gbiz 美国万兆带宽供应商,主打美国直连大带宽,真实硬防。除美国外还提供线路非常优质的香港、日本等数据中心可供选择,全部机房均支持增加独立硬防。洛杉矶特色线路去程三网直连(电信、联通、移动)回程CN2 GIA优化,全天低延迟。中国大陆访问质量优秀,最多可增加至600G硬防。香港七星级网络,去程回程均为电信CN2 GIA+联通+移动,大陆访问相较其他香港GIA线路平均速度更...
googlepr值为你推荐
2020年《腾讯广告服务商-申请提报资料》yw372:Com帮个忙 这个视频源地址怎么找http://video.kuaiji.com/congye/diansuanhua/372/3097phpadmin下载phpMyAdmin 软件下载地址centos6.5centos 6.5 无法启动了,不知道是哪里的问题。asp.net网页制作如何用ASP.NET做网站?重庆网站制作重庆网站制作哪家好,重庆做网站制作的公司有谁比较了解的,应该去哪里做好些?支付宝注册网站支付宝申请流程是怎么样的??碧海银沙网怎样在碧海银沙网里发布图片?三五互联科技股份有限公司厦门三五互联科技股份有限公司 怎么样?即时通EC营销即时通是什么?做什么的?
二级域名 过期域名查询 国内vps site5 mediafire 好看的桌面背景图片 lamp配置 免费ftp站点 网通ip 智能骨干网 免费mysql ftp教程 howfile 昆明蜗牛家 绍兴电信 银盘服务 视频服务器是什么 深圳域名 万网注册 国外网页代理 更多