331AAccents,LaTeX,319Advanceddataaggregationapply()functions,162–165merge(),163transform()function,162–163Anacondapackages,65types,64Arraymanipulationjoiningarrayscolumn_stack(),52hstack()function,51row_stack(),52vstack()function,51splittingarrayshsplit()function,52split()function,53–54vsplit()function,52Artificialintelligence,3BBasicoperationsaggregatefunctions,44arithmeticoperators,41–42decrementoperators,43–44incrementoperators,43–44matrixproduct,42–43universalfunctions(ufunc),44Bayesianmethods,3CChoroplethmapsD3library,300geographicalrepresentations,300HTML()function,302Jinja2,302–303JSONandTSV,303JSONTopoJSON,300–301require.
config(),301USpopulation,2014datasourcecensus.
gov,306fileTSV,codes,305Jinja2.
Template,307–308pop2014_by_countydataframe,305population.
csv,306–307render()function,308–309SUMLEVvalues,304Classificationmodels,8Climaticdata,329ClusteredbarchartIPythonNotebook,296–297Jinja2,297,299render()function,299–300Clusteringmodels,3,7–8Combining,139–140Concatenating,136–139ConditionsandBooleanArrays,50Correlation,94–95Covariance,94–95Cross-validation,8DDataaggregationgroupby,157–158hierarchicalgrouping,159price1column,158split-apply-combine,157Dataanalysisdatavisualization,1definition,1deploymentphase,2information,4knowledgedomainsartificialintelligence,3computerscience,2–3fieldsofapplication,3–4machinelearning,3mathematicsandstatistics,3Index332opendata,10–11predictivemodel,1problemsof,2processdataexploration/visualization,7dataextraction,6datapreparation,7deployment,8modelvalidation,8predictivemodeling,8problemdefinition,5stages,5purposeof,1Pythonand,11quantitativeandqualitative,9–10typescategoricaldata,4numericaldata,4DataFramedefinition,75–76nesteddict,81structure,75transposition,81–82DatapreparationDataFrame,132pandas.
concat(),132pandas.
DataFrame.
combine_first(),132pandas.
merge(),132Datastructures,operationsDataFrame,88–89flexiblearithmeticmethods,88Datatransformationdrop_duplicates()function,144removingduplicates,143–144Datavisualization3Dsurfaces,227,229addingtextaxislabels,184informativelabel,187mathematicalexpression,187–188modified,185barcharterrorbars,210horizontal,210–211matplotlib,207multiseriesstackedbar,215–217pandasDataFrame,213–214xticks()function,208barchart3D,230–231charttypology,198contourplot,223–225dataanalysis,167displaysubplots,231,233grid,188–189,233,235handlingdatevalues,196–198histogram,206–207HTMLfile,193–195imagefile,195installation,168IPythonQtConsole,168,170kwargshorizontalsubplots,183linewidth,182verticalsubplots,183–184legend,189–191linechartannotate(),204arrowpropskwarg,204Cartesianaxes,203colorcodes,200–201colorsandlinestyles,200–201datapoints,198gca()function,203Greekcharacters,202LaTeXexpression,204mathematicalexpressions,199,205pandas,205–206set_position()function,203threedifferentseries,199–200xticks()functions,201yticks()functions,201matplotlibarchitectureandNumPy,179–181artistlayer,171–172backendlayer,170functionsandtools,170Line2Dobject,174plottingwindow,174plt.
plot()function,177properties,plot,177,179pylabandpyplot,172–173Pythonprogramminglanguage,173QtConsole,175–176scriptinglayer,172matplotlibLibrary,167–168mplot3d,227piecharts,219–221,223polarchart,225–227saving,code,192–193scatterplot,3D,229Decisiontrees,7Detectingandfilteringoutliersany()function,151describe()function,151std()function,151Digitsdatasetdefinition,312digits.
imagesarray,314Dataanalysis(cont.
)333digit.
targetsarray,314handwrittendigits,314handwrittennumberimages,312matplotliblibrary,314scikit-learnlibrary,313Discretizationcategoricaltype,148cut()function,148–151qcut(),150–151value_counts()function,149Django,11Dropping,85–86EEclipse(pyDev),30FFinancialdata,329Flexiblearithmeticmethods,88Fonts,LaTeX,319Functionalities,indexesarithmeticanddataalignment,86–87dropping,85–86reindexing,83–85Functionapplicationandmappingelement,89–90row/column,90–91statistics,91GGroupiterationchainoftransformations,160–161functionsongroupsmark()function,161–162quantiles()function,161groupbyobject,160HHandwritingrecognitiondigitsdataset,312–314digitswithscikit-learn,311–312handwrittendigits,matplotliblibrary,315learningandpredicting,315–316OCRsoftware,311svcestimator,316validationset,sixdigits,315–316Healthdata,328Hierarchicalindexingarrays,99DataFrame,98reorderingandsortinglevels,100stack()function,99structure,98summarystatistics,100two-dimensionalstructure,97IIDEs.
SeeInteractivedevelopmentenvironments(IDEs)IDLE.
SeeIntegrateddevelopmentenvironment(IDLE)Integrateddevelopmentenvironment(IDLE),29Interactivedevelopmentenvironments(IDEs)Eclipse(pyDev),30IDLE,29Komodo,32Liclipse,31–32NinjaIDE,32Spyder,29Sublime,30–31Interactiveprogramminglanguage,14Interfacedprogramminglanguage,14Interpretedprogramminglanguage,13Interpretercharacterization,14Cython,15Jython,15PVM,14PyPy,15IPythonJupyterproject,27Notebook,26–27Qt-Console,26shell,24–25IPythonNotebook,312CSVfiles,274–275DataFrames,272–273humidity,282–283JSONstructure,270–271matplotliblibrary,275pandaslibrary,271read_json()function,270SVRmethod,278–279temperature,275–278,281IrisflowerdatasetAndersonIrisDataset,238IPythonQtConsole,239Irissetosafeatures,241lengthandwidth,petal,241–242matplotliblibrary,240targetattribute,240typesofanalysis,239variables,241334JJavaScriptD3Librarybarchart,296CSSdefinitions,293–294data-drivendocuments,293HTMLimportinglibrary,293IPythonNotebooks,293Jinja2library,294–295Pandasdataframe,296render()function,296require.
config(),293webcharts,creation,293Jinja2library,294–295Joinoperations,132Jupyterproject,27KK-nearestneighborsclassification2Dscatterplot,sepals,245decisionboundaries,246–247predict()function,244random.
permutation(),244trainingandtestingset,244LLaTeXaccents,319fonts,319fractions,binomials,andstackednumbers,318radicals,318subscriptsandsuperscripts,318symbolsarrowsymbols,319,324–325bigsymbols,321binaryoperationandrelationsymbols,321,323delimiters,320hebrew,320lowercaseGreek,320miscellaneoussymbols,319standardfunctionnames,321uppercaseGreek,320withIPythonNotebookinmarkdowncell,317inPython2cell,317withmatplotlib,317Liclipse,31–32Linearregression,8Linuxdistribution,65Loadingandwritingdatadataframe,127pgAdminIII,127postgreSQL,126read_sql()function,125read_sql_query()function,128read_sql_table()function,128sqlite3,124LODclouddiagram,10Logisticregression,8MMachinelearning,3developmentofalgorithms,237diabetesdataset,247–248features/attributes,237learningproblem,237linearregressioncoef_attribute,249linearcorrelation,250parameters,248physiologicalfactors,251–252progressionofdiabetes,251–252supervisedlearning,237–238trainingandtestingset,238unsupervisedlearning,238MappingaddingValues,145–146inplaceoption,147rename()function,147renaming,axes,146–147replacingValues,144–145Matlab,11MergingDataFrame,132–133join()function,135–136left_onandright_on,134–135merge(),132–133MeteorologicaldataAdriaticSea,266–267climate,265Comacchio,268datasourceJSONfile,269weathermap,269IPythonNotebook,270mountainousareas,265windspeed,287–288Microsoftexcelfilesdata.
xls,116–117internalmodulexlrd,116read_excel()function,116Musicaldata,330NNdarrayarray()function,36–38data,types,38335dtypeOption,39intrinsiccreation,39–40type()function,36–37NotaNumber(NaN)datafilling,NaNoccurrences,97filteringoutNaNvalues,96–97NaNvalue,96NumPylibraryarray,Iterating,48–49broadcastingcompatibility,56complexcases,57operator/function,55BSD,35copies/viewsofobjects,54–55dataanalysis,35indexing,33,45–46ndarray,36Numarray,35pythonlanguage,35slicing,46–48vectorization,55OObject-orientedprogramminglanguage,14OCRsoftware.
SeeOpticalcharacterrecognition(OCR)softwareOpendatasources,10,11climaticdata,329–330financialdata,329fordemographicsIPythonNotebook,290Pandasdataframes,290pop2014_by_statedataframe,291pop2014dataframe,290–291UnitedStatesCensusBureau,289withmatplotlib,292healthdata,328miscellaneousandpublicdatasets,329musicaldata,330politicalandgovernmentdata,327–328publications,newspapers,andbooks,330socialdata,328sportsdata,330Open-sourceprogramminglanguage,14Opticalcharacterrecognition(OCR)software,311Order()function,93PPandasdataframes,290,296Pandasdatastructuresassigningvalues,70,78–79DataFrame,75–76declaringseries,68–69deletingcolumn,80dictionaries,series,74duplicatelabels,82–83evaluatingvalues,72filteringvalues,71,80internalelements,selection,69mathematicalfunctions,71membershipvalue,80NaNvalues,73NumPyarraysandexistingseries,70–71operations,71,74selectingelements,77–78series,68Pandaslibrarycorrelationandcovariance,94–95datastructures.
(seePandasdatastructures)datastructures,operations,87–89functionalities.
(seeFunctionalities,indexes)functionapplicationandmapping,89–91gettingstarted,67hierarchicalindexingandleveling,97–101installationAnaconda,64–65developmentphases,67Linux,65modulerepository,windows,66PyPI,65source,66NotaNumber(NaN)data,95–97pythondataanalysis,63–64sortingandranking,91–94Permutationnew_orderarray,152numpy.
random.
permutation()function,152randomsamplingDataFrame,152np.
random.
randint()function,152take()function,152Pickle—pythonobjectframe.
pkl,123pandaslibrary,123Pivotinghierarchicalindexing,140–141longtowideformat,141–142stack()function,140unstack()function,140Politicalandgovernmentdata,327–328Pop2014_by_countydataframe,305Pop2014_by_statedataframe,291–292Pop2014dataframe,290–291Portableprogramminglanguage,13Principalcomponentanalysis(PCA),242–243Publicdatasets,329PVM.
SeePythonvirtualmachine(PVM)PyPI.
SeePythonpackageindex(PyPI)PyPyinterpreter,15336Python,11Pythondataanalysislibrary,63–64Pythonmodule,67Pythonpackageindex(PyPI),28Python'sworlddistributionsAnaconda,16–17EnthoughtCanopy,17Python(x,y),18IDEs.
(seeInteractivedevelopmentenvironments(IDEs))implementation,code,19installation,16interact,19interpreter,14–15IPython,24–27programminglanguage,13–14PyPI,28Python2,15Python3,15run,entireprogramcode,18–19SciPy,32–34shell,18writingpythoncodedatastructure,21–22functionalprogramming,23indentation,24librariesandfunctions,20–21mathematicaloperations,20Pythonvirtualmachine(PVM),14QQualitativeanalysis,9,10Quantitativeanalysis,9,10RR,11Radicals,LaTeX,318Ranking,93–94Readingandwritingarraybinaryfiles,59–60tabulardata,60–61Readingandwritingdatabooks.
json,119create_engine()function,124CSVandtextualfilesextension.
txt,104headeroption,105index_coloption,106myCSV_01.
csv,104myCSV_03.
csv,106namesoption,105read_csv()function,104,106read_table()function,105DataFrameobjects,103frame.
json,119functionalities,103HDF5library,121HDFStore,121HTMLfilesdatastructures,111myFrame.
html,112read_html(),113to_html()function,111–112web_frames,113webpages,111I/OAPItools,103–104JSONdataJSONViewer,118read_json()andto_json(),118json_normalize()function,120mydata.
h5,121normalization,119NoSQLdatabasesinsert()function,129MongoDB,128–130pandas.
io.
sqlmodule,124pickle—pythonobjectcPickle,122–123streamofbytes,122PyTablesandh5py,121read_json()function,120sqlalchemy,124TXTfile,106–108usingregexpmetacharacters,107read_table(),106skiprows,108ReadingDatafromXMLbooks.
xml,114getchildren(),115getroot()function,115lxml.
etreetreestructure,115lxmllibrary,114objectify,114parse()function,115tagattribute,115textattribute,115ReadingTXTfilesnrowsandskiprowsoptions,108portionbyportion,108Regressionmodels,3,8Reindexing,83–85Removing,142RoseWindDataFrame,284histarray,285polarchart,285–287showRoseWind()function,285,287337S,TScikit-learnPCA,242–243Pythonmodule,237Scikit-learnlibrary,311dataanalysis,311sklearn.
svm.
SVC,312svmmodule,312SciPymatplotlib,34NumPy,33Pandas,33Shapemanipulationreshape()function,50shapeattribute,51transpose()function,51Socialdata,328Sort_index()function,91,93Sortlevel()function,100Sportsdata,330Stack()function,99Stringmanipulationbuilt-inmethodscount()function,154errormessage,154index()andfind(),154join()function,154replace()function,154split()function,153strip()function,153regularexpressionsfindall()function,155–156match()function,156re.
compile()function,155regex,155re.
split()function,155split()function,155Structuredarraysdtypeoption,58–59structs/records,58Subscriptsandsuperscripts,LaTeX,318Supportvectorclassification(SVC)effect,decisionboundary,256–257nonlinear,257–259numberofpoints,Cparameter,256predict()function,255support_vectorsarray,255trainingset,decisionspace,253–254twoportions,255Supportvectorclassification(SVC),312Supportvectormachines(SVMs)decisionalspace,253decisionboundary,253IrisDatasetdecisionboundaries,259lineardecisionboundaries,259–260polynomialdecisionboundaries,261polynomialkernel,260–261RBFkernel,261trainingset,259SVC.
(seeSupportvectorclassification(SVC))SVR.
(seeSupportvectorregression(SVR))Supportvectorregression(SVR)curves,263diabetesdataset,262linearpredictivemodel,262testset,data,262Swaplevel()function,100U,VUnitedStatesCensusBureau,289–290Urllib2library,290W,X,Y,ZWebScraping,2,6Writingdatana_repoption,110to_csv()function,109–110
快快云怎么样?快快云是一家成立于2021年的主机服务商,致力于为用户提供高性价比稳定快速的主机托管服务,快快云目前提供有香港云服务器、美国云服务器、日本云服务器、香港独立服务器、美国独立服务器,日本独立服务器。快快云专注为个人开发者用户,中小型,大型企业用户提供一站式核心网络云端服务部署,促使用户云端部署化简为零,轻松快捷运用云计算!多年云计算领域服务经验,遍布亚太地区的海量节点为业务推进提供强大...
妮妮云的来历妮妮云是 789 陈总 张总 三方共同投资建立的网站 本着“良心 便宜 稳定”的初衷 为小白用户避免被坑妮妮云的市场定位妮妮云主要代理市场稳定速度的云服务器产品,避免新手购买云服务器的时候众多商家不知道如何选择,妮妮云就帮你选择好了产品,无需承担购买风险,不用担心出现被跑路 被诈骗的情况。妮妮云的售后保证妮妮云退款 通过于合作商的友好协商,云服务器提供2天内全额退款到网站余额,超过2天...
香港站群多ip服务器多少钱?想做好站群的SEO优化,最好给每个网站都分配一个独立IP,这样每个网站之间才不会受到影响。对做站群的站长来说,租用一家性价比高且提供多IP的香港多ip站群服务器很有必要。零途云推出的香港多ip站群云服务器多达256个IP,可以满足站群的优化需求,而且性价比非常高。那么,香港多ip站群云服务器价格多少钱一个月?选择什么样的香港多IP站群云服务器比较好呢?今天,小编带大家一...
sublimetext2为你推荐
朱祁钰和朱祁镇哪个好历史上真实的明英宗是怎么样的?性格之类的。朱祁钰和朱祁镇的相关的最好可以详细点的三国游戏哪个好玩哪款三国游戏最好玩`!苹果手机助手哪个好苹果手机助手哪个好用些谁知道播放器哪个好什么播放器好用51空间登录以前的51空间怎么进?51个人空间登录51.com个人空间怎么无法登录?qq空间登录不上qq空间登不进去 怎么办qq空间登录电脑求助,怎么登陆电脑版的qq空间YunOS手机显示yunos停止运行是什么意思东莞电信网上营业厅东莞虎门的中国电信营业厅的电话是多少?
域名出售 中国万网域名注册 北京域名注册 免费申请网页 host1plus 免费cdn加速 宕机监控 免费ftp空间申请 蜗牛魔方 电子邮件服务器 100m空间 世界测速 免费mysql数据库 网购分享 上海电信测速 下载速度测试 ebay注册 韩国代理ip 国外在线代理服务器 华为k3 更多