recommendations37

yw372:Com  时间:2021-02-13  阅读:()
DISCOVERYANDANALYSISOFWEBUSAGEMININGMARATHEDAGADUMITHARAMR.
C.
PatelA.
C.
S.
College,Shirpur,Maharashtra,IndiaABSTRACTInthispaperwedescribesomeofthemostcommontypesofpatterndiscoveryandanalysistechniquesemployedintheWebusagemining.
InthispapermentionAssociationandClusterAnalysis.
AssociationRuleisafundamentalofDataminingtask.
Itsobjectivetofindallco-occurrencerelationshipcalled,Associationamongdataitem.
LetI={i1,i2,…,im}beasetofitems.
LetT=(t1,t2,…,tn)beasetoftransactions.
ClusteranalysisandvisitorssegmentationClusteringisadataminingtechniquethatgroupstogetherasetofitemshavingsimilarcharacteristics.
Intheusagedomain,therearetwokindsofinterestingclustersthatcanbediscovered:userclustersandpageclusters.
GoalDiscoveryandanalysisofwebusagepatternsusingAssociationanalysis.
DiscoveryandanalysisofwebusagepatternsusingClusterAnalysisandVisitorssegmentation.
KEYWORDS:AssociationAnalysis,ClusterAnalysisandVisitorsSegmentationINTRODUCTIONAssociationrulediscoveryandstatisticalcorrelationanalysiscanfindgroupsofitemsorpagesthatarecommonlyaccessedorpurchasedtogether.
AssociationbasedonApriorialgorithm.
Thisalgorithmfindsgroupsofitemusingsupportandconfidence.
Satisfyingauserspecifiedminimumsupportthreshold.
Suchgroupsofitemsarereferredtoasfrequentitemsets&frequentitemsetsgraph.
Logfilesgeneratedbywebserverscontainenormousamountsofwebusagedatathatispotentiallyvaluableforunderstandingthebehaviorofwebsitevisitors.
Clusteringofuserrecords(sessionsortransactions)isoneofthemostcommonlyusedanalysistasksinWebusageminingandWebanalytics.
Clusteringofuserstendstoestablishgroupsofusersexhibitingsimilarbrowsingpatterns.
Suchknowledgeisespeciallyusefulforinferringuserdemographicsinordertoperformmarketsegmentationine-commerceapplicationsorprovidepersonalizedWebcontenttotheuserswithsimilarinterests.
Furtheranalysisofusergroupsbasedontheirdemographicattributes(e.
g.
,age,gender,incomelevel,etc.
)mayleadtothediscoveryofvaluablebusinessintelligence.
Usage-basedclusteringhasalsobeenusedtocreateWeb-based"usercommunities"reflectingsimilarinterestsofgroupsofusers,andtolearnusermodelsthatcanbeusedtoprovidedynamicrecommendationsinWebpersonalizationapplications.
ASSOCIATIONRULESupport&ConfidenceTheSupportofrule,XYthepercentageoftransactioninTthatcontainsXUY.
nisthenumberoftransactioninT.
Supportisusefulmeasurementofitemsetoritems.
IfXistruethenchecksforY,ifXisfalsethennothingtobesayY.
InthefollowingexampleXunionYthencount.
InternationalJournalofComputerScienceEngineeringandInformationTechnologyResearch(IJCSEITR)ISSN2249-6831Vol.
3,Issue1,Mar2013,313-320TJPRCPvt.
Ltd.
314MaratheDagaduMitharame.
g.
(XUY).
CountSupportN(XUY).
CountConfidenceX.
CountUsingaboveexampleswecanaccepttheminsubandminconf.
Tocalculateminsubandminconfasfollows.
T1C++,JAVA,RUBYT2C++,ASPT3ASP,VBT4C++,JAVA,ASPT5C++,JAVA,PHP,ASP,RUBYT6JAVA,PHP,RUBYT7JAVA,RUBY,PHPJAVA,PHPRUBY[sup=3/7,conf=3/3]Inabove7transactionsJAVA,PHP&RUBYshow3/7times.
EveryitemchecksitemsettoeveryusingJoiningandPruningsteps.
Inwebusageminingsuchrulecanbeusetooptimizestructureofwebsite.
e.
g.
Language,/product/softwareRCPACSCOLLEGEWebsiteEXPERIMENT-FINDINGWEBUSAGEASSOCIATIONRULESInstances:14Attributes:5outlooktemperatureDiscoveryandAnalysisofWebUsageMining315humiditywindyplayIfchecksunny,falseyes[sub1/14conf1/1]Thepurposeofthisexperimentwastogivesomeinsightintotheusefulnessofassociationruleswhentheyareappliedtotheweblogdatasetofaneducationinstitutionandothers.
Weexpectedtofindrulesthatcorrelatetowebpagesthatcontaininformationaboutsunny,rainyortemperatureetc.
SupposethisistransactiontableandfindoutFrequentItemsetthen,T1C++,JAVA,RUBYT2C++,ASPT3ASP,VBT4C++,JAVA,ASPT5C++,JAVA,PHP,ASP,RUBYT6JAVA,PHP,RUBYT7JAVA,RUBY,PHPSize1Size2Size3Size4ItemSetSupp.
ItemSetSupp.
ItemSetSupp.
ItemSetSupp.
C++4C++,JAVA3C++,JAVA,RUBY2C++,JAVA,RUBY,ASP1JAVA5C++,RUBY2C++,JAVA,ASP2C++,JAVA,RUBY,PHP1RUBY4C++,ASP3JAVA,RUBY,ASP1ASP4C++,PHP1JAVA,RUBY,PHP3VB1JAVA,RUBY4RUBY,ASP,PHP1PHP3JAVA,ASP2JAVA,PHP3RUBY,ASP1RUBY,PHP3ASP,PHP1Figure1:WebTransactionsandResultingFrequentItemsets(Minsup=1)FindoutFrequentItemsetbyUsingJoiningandPruningMethodsofAssociationRuleFREQUENTITEMSETGRAPHFig.
2,findsitemsC++andRUBYascandidaterecommendations.
TherecommendationscoresofitemAandCare1,correspondingtotheconfidencesoftherules,JAVA,ASP->C++andJAVA,ASP->RUBY,respectively.
Aproblemwithusingasingleglobalminimumsupportthresholdinassociationruleminingisthatthediscoveredpatternswillnotinclude"rare"butimportantitemswhichmaynotoccurfrequentlyinthetransactiondata.
316MaratheDagaduMitharamC=C++J=JAVAA=ASPR=RUBYP=PHPFigure2:FrequentItemsetsCLUSTERANALYSISANDVISITORSSEGMENTATIONConceptandExampleClusteringofuserrecords(sessionsortransactions)isoneofthemostcommonlyusedanalysistasksinWebusageminingandWebanalytics.
Clusteringofuserstendstoestablishgroupsofusersexhibitingsimilarbrowsingpatterns.
Suchknowledgeisespeciallyusefulforinferringuserdemographicsinordertoperformmarketsegmentationine-commerceapplicationsorprovidepersonalizedWebcontenttotheuserswithsimilarinterests.
DiscoveryandAnalysisofWebUsageMining317HereweUsetheformulaof"WebDataMining"-Bingliubook.
Asanexample,considerthetransactiondatadepictedinsimplicityweassumethatfeature(pageview)weightsineachtransactionvectorarebinary(incontrasttoweightsbasedonafunctionofpageviewduration).
Weassumethatthedatahasalreadybeenclusteredusingastandardclusteringalgorithmsuchask-means,resultinginthreeclustersofusertransactions.
Itshowstheaggregateprofilecorrespondingtocluster1.
Asindicatedbythepageviewweights,pageviewsBandFarethemostsignificantpagescharacterizingthecommoninterestsofusersinthissegment.
PageviewC,however,onlyappearsinonetransactionandmightberemovedgivenafilteringthresholdgreaterthan0.
25.
Suchpatternsareusefulforcharacterizinguserorcustomersegments.
Thisexample,forinstance,indicatesthattheresultingusersegmentisclearlyinterestedinitemsBandFandtoalesserdegreeinitemA.
GivenanewuserwhoshowsinterestinitemsAandB,thispatternmaybeusedtoinferthattheusermightbelongtothissegmentand,therefore,wemightrecommenditemFtothatuser.
ExperimentandResultsInthisexperimentwedefinetable"weather"anddefinefields.
318MaratheDagaduMitharamOutputUsingClusterinWeka===Runinformation===Scheme:weka.
clusterers.
HierarchicalClusterer-N2-LSINGLE-P-A"weka.
core.
EuclideanDistance-Rfirst-last"Relation:weatherInstances:13Attributes:5outlooktemperaturehumiditywindyIgnoredplayTestmode:Classestoclustersevaluationontrainingdata===Modelandevaluationontrainingset===Cluster0((((((1.
0:0.
18505,1.
0:0.
18505):0.
05959,1.
0:0.
24464):0.
7557,(1.
0:0.
16832,(1.
0:0.
08235,1.
0:0.
08235):0.
08597):0.
83201):0.
00109,((0.
0:0.
22986,0.
0:0.
22986):0.
77157,0.
0:1.
00142):0):0.
00106,(0.
0:0.
21648,0.
0:0.
21648):0.
78601):0.
00135,1.
0:1.
00384)ClusteredInstances012(92%)11(8%)Classattribute:playClassestoClusters:01<--assignedtocluster71|yes50|noCluster0<--yesCluster1<--NoclassIncorrectlyclusteredinstances:6.
046.
1538%DiscoveryandAnalysisofWebUsageMining319VisualizationsofPatternsCONCLUSIONSUsagepatternsdiscoveredthroughWebusageminingareeffectiveincapturingitem-to-itemanduser-to-userrelationshipsandsimilaritiesatthelevelofusersessions.
Thispaperhasattemptedtoforthepurposeofwebusagemining.
TheproposedmethodsweresuccessfullytestedonthedatasetordatabasesusingassociationruleandclusteranalysismethodusingWekaTool.
Ourexperimentsconfirmedthatoneofthemajorissuesinassociationruleandclusterfindingistheexistenceoftoomanyrulesandgroups,allofwhichsatisfydefinedconstraints.
REFERENCES1.
Webdatamining–BingLiu320MaratheDagaduMitharam2.
PPTforWebusagemining-BingLiu3.
Srivastava,J.
,Cooley,R.
,Deshpande,M.
,Tan,P.
N.
(2000).
WebUsageMining:DiscoveryandApplicationsofUsagePatternsfromWebData.
ACMSIGKDD,Jan2000.
4.
JaideepSrivastavaPaper5.
WCA.
Webcharacterizationterminology&definitions.
6.
http://www.
w3.
org/1999/05/WCA-terms/.
Vigenteal19/11/2005

CYUN(29元/月)美国、香港、台湾、日本、韩国CN2,续费原价

关于CYUN商家在之前有介绍过一次,CYUN是香港蓝米数据有限公司旗下的云计算服务品牌,和蓝米云、蓝米主机等同属该公司。商家主要是为个人开发者用户、中小型、大型企业用户提供一站式核心网络云端部署服务,促使用户云端部署化简为零,轻松快捷运用云计算。目前,CYUN主要运营美国、香港、台湾、日本、韩国CN2线路产品,包括云服务器、站群服务器和独立服务器等。这次看到CYUN夏季优惠活动发布了,依然是熟悉的...

易探云美国云服务器评测,主机低至33元/月,336元/年

美国服务器哪家平台好?美国服务器无需备案,即开即用,上线快。美国服务器多数带防御,且有时候项目运营的时候,防御能力是用户考虑的重点,特别是网站容易受到攻击的行业。现在有那么多美国一年服务器,哪家的美国云服务器好呢?美国服务器用哪家好?这里推荐易探云,有美国BGP、美国CN2、美国高防、美国GIA等云服务器,线路优化的不错。易探云刚好就是做香港及美国云服务器的主要商家之一,我们来看一下易探云美国云服...

日本美国站群服务器raksmart站群新增,限量低至月1.99美元

RAKsmart 商家八月份的促销活动今天更新。基本上和上个月的产品套餐活动差不多的,不过也是有简单的微调。对于RAKsmart商家还是比较了解的,他们家产品虽然这两年增加多个机房,以及在VPS主机方案上有丰富的机房和调整到一些自营机房,他们家的策划能力还是有限,基本上每个月的套餐活动都差不多。RAKsmart 在八月份看到有新增香港高防服务器可选,最高100GB防御。同时原来上个月缺货的日本独立...

yw372:Com为你推荐
支持ipad深圳市残友电子善务股份有限公司phpweb破解怎样破解握手包flashfxp用Flashfxp上传网站的具体步骤filezillaserver怎么用FileZilla Server 0.9.27 绿色汉化版软件?申请支付宝账户怎么申请支付宝的账号?360防火墙在哪里360防火墙filezilla_server如何用FileZilla Server新增FTP帐号filezilla_serverFileZilla无法连接服务器怎么解决计算机cuteftp
域名停靠 重庆域名注册 日本vps net主机 hostgator 表格样式 抢票工具 512au 京东云擎 天猫双十一抢红包 网通服务器ip 服务器架设 租空间 铁通流量查询 165邮箱 双十一秒杀 东莞数据中心 广州服务器 vip域名 Updog 更多