SystemRequirementsBitfusionGuideWHITEPAPER–OCTOBER2019WHITEPAPER|2VMwareBitfusion:SystemRequirementsTableofContentsSystemRequirements3HardwareRequirements3Networking4VerifyingSystemHealth5SoftwareDependencies6Ubuntu6CentOSandRedHat7ListCUDALibraries.
7VerifytheCUDAInstallation7CUDAInstallationonCPUNodes/VMs/Containers7WHITEPAPER|3SystemRequirementsFlexDirectcanbeinstalledfordifferentmodesofoperation.
Somemodesinclude,inwholeorinpart,theabilitiesofothermodes.
Thisonedocumentliststherequirementsforeachmode.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGERUbuntuLTS16.
04CentOS7orRHEL7.
4+NVIDIAToolkitCUDA7.
5CUDA8CUDA9CUDA10UbuntuLTS16.
04CentOS7orRHEL7.
4+UbuntuLTS16.
04CentOS7orRHEL7.
4+UbuntuLTS16.
04CentOS7orRHEL7.
4+NVIDIAGPUdriverversion372orhigherNVIDIAGPUdriverversion372orhigherNVIDIAGPUdriverversion372orhigherNote:CanalsorunApplicationsunderFlexDirectClient,inwhichcaseNVIDIAToolkitprerequisitesareneeded.
Note:CanalsorunApplicationsunderFlexDirectClient,inwhichcaseNVIDIAToolkitprerequisitesareneeded.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGERN/AAnygenerationCUDA-enabledNVIDIAGPU(s)AnygenerationCUDA-enabledNVIDIAGPU(s)AnygenerationCUDA-enabledNVIDIAGPU(s)10GbsormoreEthernet(TCP/IP)RoCEorInfinibandNetworkConnectivity10GbsormoreEthernet(TCP/IP)RoCEorInfiniband10GbsormoreEthernet(TCP/IP)RoCEorInfinibandSystemmemoryontheclientCPUmachinesshouldbe1.
5xthetotalaggregateGPUmemoryacrossalltheGPUsinthelargestGPUmachine.
N/AN/AN/AHardwareRequirementsWHITEPAPER|4NetworkingFlexDirectmakesuseofseveralportsorrangesofportsforprocesstoprocesscommunication.
Pleaseensurethatyourfirewallsdonotblocktheportsbelow.
Pleaseensuretherearenoportconflictswithotherapplications.
Someoftherangescanbemodifiedwithcommand-linearguments.
FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGER56001ServerCommunicationOutboundto:–Servernodes–Managernodes56001ServerCommunicationInboundfrom:–Analyticsnodes–ManagernodesOutboundto:–Analyticsnodes–Managernodes56001ServerCommunicationInboundfrom:–Clientnodes56001ServerCommunicationInboundfrom:–Clientnodes–Analyticsnodes–ManagernodesOutboundto:–Analyticsnodes–ManagernodesNote:Overridewith--srs_port56008,forexample.
Note:Overridewith--srs_port56008,forexample.
Note:Overridewith--srs_port56008,forexample.
55001-55100DispatcherCommunicationOutboundto:–Servernodes–Managernodes55001-55100DispatcherCommunicationN/A55001-55100DispatcherCommunicationInboundfrom:–Clientnodes55001-55100DispatcherCommunicationInboundfrom:–Clientnodes45201-46225CUDACommunicationOutboundto:–Servernodes–Managernodes45201-46225CUDACommunicationN/A45201-46225CUDACommunicationInboundfrom:–Clientnodes45201-46225CUDACommunicationInboundfrom:–Clientnodes54000WebServerN/A54000WebServerInboundfrom:–Anyonewithabrowser54000WebServerN/A54000WebServerInboundfrom:–AnyonewithabrowserNote:Overridewith--web_port12345,forexample.
Note:Overridewith--web_port12345,forexample.
WHITEPAPER|5FLEXDIRECTCLIENTSFLEXDIRECTANALYTICSFLEXDIRECTSERVERFLEXDIRECTMANAGER54001StatisticsCollectionN/A54001StatisticsCollectionInbound/Outbound–localhostonly54001StatisticsCollectionN/A54001StatisticsCollectionInbound/Outbound–localhostonlyNote:Overridewith--collection_port54321,forexample.
Note:Overridewith--collection_port54321,forexample.
Ensureyourmachineshaveingress/egressinternetaccessforaccesstodownloads,licensing,etc.
Checkforothernetworkingpolicies,suchasoutboundproxyorinternalDNS,staticversusdynamicIPs.
ConsultwiththeBitfusionteamifyouareusingdynamicIPsorhaveanyproxiessetup.
VerifyingSystemHealthOnceyouhaveinstalledFlexDirect,youcanvalidateyourenvironmentforthebestresultsbyrunningtheFlexDirecthealthcheck.
Shell#Assumesflexdirectisinstalled;seeinstallationguide.
flexdirecthealthThehealthcheckwillrunchecksonthenodesofyourclusterappropriatetotheirhardware(e.
g.
,GPUs)ortotheirconfiguration(e.
g.
RoCE).
Soyouroutputmaydifferfromthatshownbelow.
Checkstrytofindsettingsorproblemsthatwilllimitorpreventthehigh-bandwidth,low-latencycommunicationneededforthebestperformance.
Theresultsshouldbeselfexplanatory.
CheckswillbeperformedonthelocalhostandonallconfiguredGPUservers.
Shownbelowistheoutputfromthecheckonthelocalnode.
WHITEPAPER|6Output$flexdirecthealthHealthreportforserver:192.
168.
10.
41:56001SoftwareVersionschecks:[PASS]Checklibrarydependency[PASS]CheckOFEDVersion:4.
3[PASS]CheckCUDAversion>=7050:Currentversion:9020[PASS]CheckGPUdriverversion>=367.
0:Currentversion:396.
37SystemResourceschecks:[PASS]Checkflexdirectinstall[PASS]CheckexternalconnectivitytoBitfusionlicenseserver.
Ignoreforon-premiseinstallations.
[PASS]Checkshadowmemory192076MBandtotalgpumem64640MBPerformancechecks:[PASS]CheckMTUSize:hi-speedinterfacesMTU>=4K[PASS]Checkmemops[PASS]Checkulimit-n>=4096:4096[MARGINAL]Checkmultinodesupport:GPUdirectnotsupported[MARGINAL]Checknv_peer_mem:nv_peer_memmodulenotloaded[SKIPPED]CannotperformPCIediagnosiswithoutrootaccess.
RunthebinarywithsudotogetdiagnosisStabilitychecks:[PASS]CheckNetworkErrors/Drops:foundnoerrorsorpacketdrops[PASS]CheckIBphysicallyup[PASS]CheckIBSMup[PASS]CheckMADagentregistrationerrors[PASS]CheckGPUAPImismatch[PASS]CheckGPUXiderrors[PASS]Checktemperaturecom/compute/cuda/repos/ubuntu1604/x86_64/cuda-repo-ubuntu1604_9.
1.
85-1_amd64.
deb$sudodpkg-icuda-repo-ubuntu1604_9.
1.
85-1_amd64.
deb$sudoapt-keyadv--fetch-keyshttp://developer.
download.
nvidia.
com/compute/cuda/repos/ubuntu1604/x86_64/7fa2af80.
pub$sudoapt-getupdate$sudoapt-getinstall-ycuda-toolkit-9-1$rmcuda-repo-ubuntu1604_9.
1.
85-1_amd64.
debListCUDALibrariesCUDAlibsmightbemissingifyourinstallationisaminimalone.
Copy/pastethefilesfrom/usr/local/cuda/lib64:VerifytheCUDAinstallationTheeasiestwaytodothiswillbetoinstalltheCUDAsamples.
HereistheRPMforCUDAsamples.
ItistheRPMforRHEL7andCUDA9.
1.
Afterinstallingthesamples,makesuredeviceQuerycanberuneitherdirectly(ifitisaGPUmachine)orwithFlexDirect(ifitisaClientCPUmachine)ScheduleatestjobusingCUDAdeviceQuery:CUDAInstallationonCPUNodes/VMs/ContainersInordertorunGPUapplicationswithFlexDirectonCPUnodes,VMs,orincontainerswhichdonothavedirectaccesstophysicalGPUhardware,youstillneedtoinstallCUDA—butthereisnoneedtoinstallanyNvidiadrivercomponents.
TheexamplebelowillustrateshowtotoinstallCUDA9.
1inafeweasysteps.
TheCUDAversionthatyouinstallontheClientshouldmatchtheCUDAversionwhichisrunningontheserver.
VMware,Inc.
3401HillviewAvenuePaloAltoCA94304USATel877-486-9273Fax650-427-5001vmware.
comCopyright2019VMware,Inc.
Allrightsreserved.
ThisproductisprotectedbyU.
S.
andinternationalcopyrightandintellectualpropertylaws.
VMwareproductsarecoveredbyoneormorepatentslistedatvmware.
com/go/patents.
VMwareisaregisteredtrademarkortrademarkofVMware,Inc.
anditssubsidiariesintheUnitedStatesandotherjurisdictions.
Allothermarksandnamesmentionedhereinmaybetrademarksoftheirrespectivecompanies.
ItemNo:VMW-0518-1843_VMW_CPBUTechnicalWhitePapers_BitfusionDocs_05SystemRequirements_1.
4_YC8/19
百驰云成立于2017年,是一家新国人IDC商家,且正规持证IDC/ISP/CDN,商家主要提供数据中心基础服务、互联网业务解决方案,及专属服务器租用、云服务器、云虚拟主机、专属服务器托管、带宽租用等产品和服务。百驰云提供源自大陆、香港、韩国和美国等地骨干级机房优质资源,包括BGP国际多线网络,CN2点对点直连带宽以及国际顶尖品牌硬件。专注为个人开发者用户,中小型,大型企业用户提供一站式核心网络云端...
RackNerd 商家我们应该是比较熟悉的商家,速度一般,但是人家便宜且可选机房也是比较多的,较多集中在美国机房。包括前面的新年元旦促销的时候有提供年付10美元左右的方案,实际上RackNerd商家的营销策略也是如此,每逢节日都有活动,配置简单变化,价格基本差不多,所以我们网友看到没有必要囤货,有需要就选择。RackNerd 商家这次2022农历新年也是有几款年付套餐。低至RackNerd VPS...
青云互联怎么样?青云互联是一家成立于2020年6月份的主机服务商,致力于为用户提供高性价比稳定快速的主机托管服务,目前提供有美国免费主机、香港主机、香港服务器、美国云服务器,让您的网站高速、稳定运行。目前,美国洛杉矶cn2弹性云限时七折,美国cera机房三网CN2gia回程 13.3元/月起,可选Windows/可自定义配置。点击进入:青云互联官网青云互联优惠码:七折优惠码:dVRKp2tP (续...
WWW YC8 COM为你推荐
外网和内网什么是内网,和外网有什么区别自助建站自助建站到底好还是不好网店推广网站怎么免费推广淘宝店铺?中小企业信息化中小企业信息化途径有哪些xp系统停止服务xp系统停止服务怎么办?安装迅雷看看播放器迅雷看看播放器安装lockdowndios8.1怎么激活内置卡贴iphone6上市时间苹果6是什么时候出的 ?系统分析员考系统分析员有什么好处?怎么上传音乐怎样可以上传本地音乐到网上?
长春域名注册 广东vps flashfxp怎么用 狗爹 韩国加速器 godaddy续费优惠码 网络星期一 名片模板psd win8升级win10正式版 免费网络电视 云鼎网络 idc资讯 me空间社区 北京双线 四川电信商城 太原联通测速 ledlamp 114dns 国外免费云空间 国外代理服务器 更多