recommendations37

yw372:Com  时间:2021-02-13  阅读:()
DISCOVERYANDANALYSISOFWEBUSAGEMININGMARATHEDAGADUMITHARAMR.
C.
PatelA.
C.
S.
College,Shirpur,Maharashtra,IndiaABSTRACTInthispaperwedescribesomeofthemostcommontypesofpatterndiscoveryandanalysistechniquesemployedintheWebusagemining.
InthispapermentionAssociationandClusterAnalysis.
AssociationRuleisafundamentalofDataminingtask.
Itsobjectivetofindallco-occurrencerelationshipcalled,Associationamongdataitem.
LetI={i1,i2,…,im}beasetofitems.
LetT=(t1,t2,…,tn)beasetoftransactions.
ClusteranalysisandvisitorssegmentationClusteringisadataminingtechniquethatgroupstogetherasetofitemshavingsimilarcharacteristics.
Intheusagedomain,therearetwokindsofinterestingclustersthatcanbediscovered:userclustersandpageclusters.
GoalDiscoveryandanalysisofwebusagepatternsusingAssociationanalysis.
DiscoveryandanalysisofwebusagepatternsusingClusterAnalysisandVisitorssegmentation.
KEYWORDS:AssociationAnalysis,ClusterAnalysisandVisitorsSegmentationINTRODUCTIONAssociationrulediscoveryandstatisticalcorrelationanalysiscanfindgroupsofitemsorpagesthatarecommonlyaccessedorpurchasedtogether.
AssociationbasedonApriorialgorithm.
Thisalgorithmfindsgroupsofitemusingsupportandconfidence.
Satisfyingauserspecifiedminimumsupportthreshold.
Suchgroupsofitemsarereferredtoasfrequentitemsets&frequentitemsetsgraph.
Logfilesgeneratedbywebserverscontainenormousamountsofwebusagedatathatispotentiallyvaluableforunderstandingthebehaviorofwebsitevisitors.
Clusteringofuserrecords(sessionsortransactions)isoneofthemostcommonlyusedanalysistasksinWebusageminingandWebanalytics.
Clusteringofuserstendstoestablishgroupsofusersexhibitingsimilarbrowsingpatterns.
Suchknowledgeisespeciallyusefulforinferringuserdemographicsinordertoperformmarketsegmentationine-commerceapplicationsorprovidepersonalizedWebcontenttotheuserswithsimilarinterests.
Furtheranalysisofusergroupsbasedontheirdemographicattributes(e.
g.
,age,gender,incomelevel,etc.
)mayleadtothediscoveryofvaluablebusinessintelligence.
Usage-basedclusteringhasalsobeenusedtocreateWeb-based"usercommunities"reflectingsimilarinterestsofgroupsofusers,andtolearnusermodelsthatcanbeusedtoprovidedynamicrecommendationsinWebpersonalizationapplications.
ASSOCIATIONRULESupport&ConfidenceTheSupportofrule,XYthepercentageoftransactioninTthatcontainsXUY.
nisthenumberoftransactioninT.
Supportisusefulmeasurementofitemsetoritems.
IfXistruethenchecksforY,ifXisfalsethennothingtobesayY.
InthefollowingexampleXunionYthencount.
InternationalJournalofComputerScienceEngineeringandInformationTechnologyResearch(IJCSEITR)ISSN2249-6831Vol.
3,Issue1,Mar2013,313-320TJPRCPvt.
Ltd.
314MaratheDagaduMitharame.
g.
(XUY).
CountSupportN(XUY).
CountConfidenceX.
CountUsingaboveexampleswecanaccepttheminsubandminconf.
Tocalculateminsubandminconfasfollows.
T1C++,JAVA,RUBYT2C++,ASPT3ASP,VBT4C++,JAVA,ASPT5C++,JAVA,PHP,ASP,RUBYT6JAVA,PHP,RUBYT7JAVA,RUBY,PHPJAVA,PHPRUBY[sup=3/7,conf=3/3]Inabove7transactionsJAVA,PHP&RUBYshow3/7times.
EveryitemchecksitemsettoeveryusingJoiningandPruningsteps.
Inwebusageminingsuchrulecanbeusetooptimizestructureofwebsite.
e.
g.
Language,/product/softwareRCPACSCOLLEGEWebsiteEXPERIMENT-FINDINGWEBUSAGEASSOCIATIONRULESInstances:14Attributes:5outlooktemperatureDiscoveryandAnalysisofWebUsageMining315humiditywindyplayIfchecksunny,falseyes[sub1/14conf1/1]Thepurposeofthisexperimentwastogivesomeinsightintotheusefulnessofassociationruleswhentheyareappliedtotheweblogdatasetofaneducationinstitutionandothers.
Weexpectedtofindrulesthatcorrelatetowebpagesthatcontaininformationaboutsunny,rainyortemperatureetc.
SupposethisistransactiontableandfindoutFrequentItemsetthen,T1C++,JAVA,RUBYT2C++,ASPT3ASP,VBT4C++,JAVA,ASPT5C++,JAVA,PHP,ASP,RUBYT6JAVA,PHP,RUBYT7JAVA,RUBY,PHPSize1Size2Size3Size4ItemSetSupp.
ItemSetSupp.
ItemSetSupp.
ItemSetSupp.
C++4C++,JAVA3C++,JAVA,RUBY2C++,JAVA,RUBY,ASP1JAVA5C++,RUBY2C++,JAVA,ASP2C++,JAVA,RUBY,PHP1RUBY4C++,ASP3JAVA,RUBY,ASP1ASP4C++,PHP1JAVA,RUBY,PHP3VB1JAVA,RUBY4RUBY,ASP,PHP1PHP3JAVA,ASP2JAVA,PHP3RUBY,ASP1RUBY,PHP3ASP,PHP1Figure1:WebTransactionsandResultingFrequentItemsets(Minsup=1)FindoutFrequentItemsetbyUsingJoiningandPruningMethodsofAssociationRuleFREQUENTITEMSETGRAPHFig.
2,findsitemsC++andRUBYascandidaterecommendations.
TherecommendationscoresofitemAandCare1,correspondingtotheconfidencesoftherules,JAVA,ASP->C++andJAVA,ASP->RUBY,respectively.
Aproblemwithusingasingleglobalminimumsupportthresholdinassociationruleminingisthatthediscoveredpatternswillnotinclude"rare"butimportantitemswhichmaynotoccurfrequentlyinthetransactiondata.
316MaratheDagaduMitharamC=C++J=JAVAA=ASPR=RUBYP=PHPFigure2:FrequentItemsetsCLUSTERANALYSISANDVISITORSSEGMENTATIONConceptandExampleClusteringofuserrecords(sessionsortransactions)isoneofthemostcommonlyusedanalysistasksinWebusageminingandWebanalytics.
Clusteringofuserstendstoestablishgroupsofusersexhibitingsimilarbrowsingpatterns.
Suchknowledgeisespeciallyusefulforinferringuserdemographicsinordertoperformmarketsegmentationine-commerceapplicationsorprovidepersonalizedWebcontenttotheuserswithsimilarinterests.
DiscoveryandAnalysisofWebUsageMining317HereweUsetheformulaof"WebDataMining"-Bingliubook.
Asanexample,considerthetransactiondatadepictedinsimplicityweassumethatfeature(pageview)weightsineachtransactionvectorarebinary(incontrasttoweightsbasedonafunctionofpageviewduration).
Weassumethatthedatahasalreadybeenclusteredusingastandardclusteringalgorithmsuchask-means,resultinginthreeclustersofusertransactions.
Itshowstheaggregateprofilecorrespondingtocluster1.
Asindicatedbythepageviewweights,pageviewsBandFarethemostsignificantpagescharacterizingthecommoninterestsofusersinthissegment.
PageviewC,however,onlyappearsinonetransactionandmightberemovedgivenafilteringthresholdgreaterthan0.
25.
Suchpatternsareusefulforcharacterizinguserorcustomersegments.
Thisexample,forinstance,indicatesthattheresultingusersegmentisclearlyinterestedinitemsBandFandtoalesserdegreeinitemA.
GivenanewuserwhoshowsinterestinitemsAandB,thispatternmaybeusedtoinferthattheusermightbelongtothissegmentand,therefore,wemightrecommenditemFtothatuser.
ExperimentandResultsInthisexperimentwedefinetable"weather"anddefinefields.
318MaratheDagaduMitharamOutputUsingClusterinWeka===Runinformation===Scheme:weka.
clusterers.
HierarchicalClusterer-N2-LSINGLE-P-A"weka.
core.
EuclideanDistance-Rfirst-last"Relation:weatherInstances:13Attributes:5outlooktemperaturehumiditywindyIgnoredplayTestmode:Classestoclustersevaluationontrainingdata===Modelandevaluationontrainingset===Cluster0((((((1.
0:0.
18505,1.
0:0.
18505):0.
05959,1.
0:0.
24464):0.
7557,(1.
0:0.
16832,(1.
0:0.
08235,1.
0:0.
08235):0.
08597):0.
83201):0.
00109,((0.
0:0.
22986,0.
0:0.
22986):0.
77157,0.
0:1.
00142):0):0.
00106,(0.
0:0.
21648,0.
0:0.
21648):0.
78601):0.
00135,1.
0:1.
00384)ClusteredInstances012(92%)11(8%)Classattribute:playClassestoClusters:01<--assignedtocluster71|yes50|noCluster0<--yesCluster1<--NoclassIncorrectlyclusteredinstances:6.
046.
1538%DiscoveryandAnalysisofWebUsageMining319VisualizationsofPatternsCONCLUSIONSUsagepatternsdiscoveredthroughWebusageminingareeffectiveincapturingitem-to-itemanduser-to-userrelationshipsandsimilaritiesatthelevelofusersessions.
Thispaperhasattemptedtoforthepurposeofwebusagemining.
TheproposedmethodsweresuccessfullytestedonthedatasetordatabasesusingassociationruleandclusteranalysismethodusingWekaTool.
Ourexperimentsconfirmedthatoneofthemajorissuesinassociationruleandclusterfindingistheexistenceoftoomanyrulesandgroups,allofwhichsatisfydefinedconstraints.
REFERENCES1.
Webdatamining–BingLiu320MaratheDagaduMitharam2.
PPTforWebusagemining-BingLiu3.
Srivastava,J.
,Cooley,R.
,Deshpande,M.
,Tan,P.
N.
(2000).
WebUsageMining:DiscoveryandApplicationsofUsagePatternsfromWebData.
ACMSIGKDD,Jan2000.
4.
JaideepSrivastavaPaper5.
WCA.
Webcharacterizationterminology&definitions.
6.
http://www.
w3.
org/1999/05/WCA-terms/.
Vigenteal19/11/2005

legionbox:美国、德国和瑞士独立服务器,E5/16GB/1Gbps月流量10TB起/$69/月起

legionbox怎么样?legionbox是一家来自于澳大利亚的主机销售商,成立时间在2014年,属于比较老牌商家。主要提供VPS和独立服务器产品,数据中心包括美国洛杉矶、瑞士、德国和俄罗斯。其中VPS采用KVM和Xen架构虚拟技术,硬盘分机械硬盘和固态硬盘,系统支持Windows。当前商家有几款大硬盘的独立服务器,可选美国、德国和瑞士机房,有兴趣的可以看一下,付款方式有PAYPAL、BTC等。...

LetBox:美国洛杉矶/新泽西AMD大硬盘VPS,10TB流量,充值返余额,最低3.3美元两个月

LetBox此次促销依然是AMD Ryzen处理器+NVME硬盘+HDD大硬盘,以前是5TB月流量,现在免费升级到10TB月流量。另外还有返余额的活动,如果月付,月付多少返多少;如果季付或者半年付,返25%;如果年付,返10%。依然全部KVM虚拟化,可自定义ISO系统。需要大硬盘vps、大流量vps、便宜AMD VPS的朋友不要错过了。不过LetBox对帐号审核严格,最好注册邮箱和paypal帐号...

青云互联:香港安畅CN2弹性云限时首月五折,15元/月起,可选Windows/可自定义配置

青云互联怎么样?青云互联是一家成立于2020年的主机服务商,致力于为用户提供高性价比稳定快速的主机托管服务,目前提供有美国免费主机、香港主机、韩国服务器、香港服务器、美国云服务器,香港安畅cn2弹性云限时首月五折,15元/月起;可选Windows/可自定义配置,让您的网站高速、稳定运行。点击进入:青云互联官方网站地址青云互联优惠码:八折优惠码:ltY8sHMh (续费同价)青云互联香港云服务器活动...

yw372:Com为你推荐
phpcms模板phpcms在后台怎样改模板360公司迁至天津360公司前身是中国吗?现总裁是谁?internetexplorer无法打开Internet Explorer 无法打开?360arp防火墙在哪360的9.6版本ARP防火墙在哪?360免费建站聚企360建站是免费的?Aliasedinternal腾讯公司电话是多少腾讯公司电话是多少电子商务世界世界前十大电子商务企业名字即时通平台老司机进来 求个直播平台123456hdAPP上面带有HD是啥意思
动态ip的vps 免费申请域名 主机测评 联通c套餐 便宜域名 shopex空间 私有云存储 metalink 流媒体加速 免费个人主页 硬防 web服务器配置 灵动:鬼影实录 灵动鬼影实录4 789小游戏 starrystarrynight 免费网络游戏下载 测试网络速度 上海联通网上营业厅 winds天天向上 更多