accelerate cloud analytics on azure with paxata...accelerate cloud analytics on azure with paxata...
TRANSCRIPT
AccelerateCloudAnalytics
onAzurewithPaxata
INSTALLINGPAXATAONAZUREMARKETPLACE
Welcome!Beitself-serviceanalyticsorexploringandprofilingyourdatalake,youhavetakentherightfirststeptowardsdemocratizingdataandinformationinyourorganizationbychoosingtoinstallPaxatainyourAzureenvironment.PaxatacustomersturnrawdataintoinformationusingPaxata’sSelf-servicedatapreparationapplication–Informationthatisclean,complete,contextualandconsumable.Nowwithone-clickinstallonAzure,and30-dayfreetriallicense,evaluatingPaxatajustgotaloteasier.ThisdocumentwalksyouthroughthenecessarystepstoprovisionPaxataonAzurewithinyourvirtualnetwork.
PRE-REQUISITEToinstallPaxataonAzure,youneedaMicrosoftaccountandAzuresubscription.Itissimpletocreateyouraccountandcreateapay-as-you-gosubscription.Ifyouworkforalargercompany,youmaywanttoaskyourITadministratortosetitupforyouandprovisionauseraccountforyouwithaccesstothesubscription.
MINIMUMREQUIREMENTS
1. ClusterTypeandVersion:ToinstallPaxata,youneedaHDInsightSparkCluster(VersionSpark2.1/HDInsight3.6)
2. HardwareRequirements:a. WorkerCPUCores:
i. ToinstallPaxata,youwillneedtohaveatleasta4CPUWorkerCores.WerecommendgettingaDorD-V2Serieshardware.
ii. Ifyouplantorunlargerinteractiveworkloads(say10MillionRows/1GB),youmayincreasethenumberofworkersupto32workercores.Thiswillgiveyougoodinteractiveresponsetime.
iii. Ifyouaregetting32cores,itisbettertogetfour8-coreVMs.Thisallowsforhigherredundancyincasethereareworkercorefailures.Atthesametimewith4cores,youwillnotpaymuchofSparkshufflecost.
b. Memoryi. Eachworkernodemusthaveatleast14GBof
memory.IfyouattempttoinstallPaxataonaclusterwithlessthan14GBworkermemory,youwillgetanerrormessage.
INSTALLINGPAXATA-THREEINSTALLPATHSTherearethreepathsonecantakeonAzureportaltoinstallPaxata.FirstvisitAzureportalatportal.azure.comandloginwithyourMicrosoftAccount.
1. FirstinstallaHDInsightsparkclusterandtheninstallPaxata.OryoucanstartfromanexistingHDInsightCluster.
2. SearchforPaxataontheportalandyouwillbeguidedtoawizardwhereyoucaninstallbothaHDInsightSparkClusterandPaxata
3. StartInstallingaHDInsightSparkClusterandyouwillbegiventheoptiontoinstallPaxataalongwithit
INSTALLPAXATAONTOPOFANEW/EXISTINGCLUSTERFirstensurethattheHDInsightclusteryouhaveprovisionedissuitableforPaxata.TheimagebelowshowstherightclustertypeyoumusthavetoinstallPaxata.Ifyouhaveanyothertypeofcluster,Paxatawillbeunavailabletobeinstalledonthecluster.
1. ClusternamebecomespartofthePaxataURL.Forexample,ifyou
nametheclusterasmycluster,theURLtoaccessyourPaxatainstallationwillbehttps://mycluster-pax.apps.azurehdinsight.net/
2. Intheclustertype,remembertoselectaLinuxbasedSparkclusterwithSparkversion2.1
3. Provideanadministratorpasswordthatyouremember.YouwillneedthistoaccesstheedgenodewherePaxatawillbeinstalled.
Thebelowimageshowsaclusterwith8workercores(2xD12v2).Youcouldstartwithaslittleas4cores(1xD14v2).AllHDInsightclusterscomestandardwithtwoheadnodes.
Onceyoustepthroughthewizardandsubmitit,yourclusterwillbereadyin15-20minutes.Onceyouhaveprovisionedacluster,youcaninstallPaxata.ClickontheApplicationslinkonyourHDInsightCluster.Inthescreenshotbelow,theapplicationslinkappearsintwoplaces.Clickingoneitheronewilltakeyoutothesameplace.Thisiscalledthe“HDInsightApplicationBlade”.
HDInsightapplicationbladeistheplacewhereyoucanseealltheapplicationsinstalledontopoftheHDInsightcluster.Inthebelowscreenshot,youcanseethattherearenoapplicationsinstalled.
Clickon+Addbuttonontopandselect“SelfServiceDataPreparationbyPaxata”.ThiswilllaunchthePaxatablade(screenshotbelow).
1. OnceyouareinPaxatablade,clickonthe“GETINSTALLKEY”link.ThiswilltakeyoutoaPaxatawebpagewitharegistrationform.
2. Whileregister,besuretoprovideavalidworkemail(e.g.
[email protected]).a. Ifyouremailisassociatedwithcompany/organizationthat
isaPaxatacustomer,prospectorapartneryouwillgetaninstallkeyimmediately(inlessthan5minutes).
b. Elsesomeonewillhavetomanuallyreviewandapproveyourrequest.Onceapprovedyouwillgetaninstallkey.Thiscouldtakeabout24hours.
3. Checkyouremailinbox.YouwillreceiveaninstallkeyalongwithasetofcredentialsfromPaxata.Besuretocheckthespamfolderafter24hours.
a. Installkeyisalongstringthattypicallylookslikethis:3a6809fe-fcdf-4d8e-8ad8-bc7c48445a81
4. Onceyouhavetheinstallkey,proceedbacktotheAzureportal>Paxatablade.
a. Enterthekeyinthe“LicenseKey”field.b. Reviewthetermsofusebeforeacceptingthem.Click
Purchaseafteryouhavereviewedtheterms.c. ClickOKinthePaxatablade.
d. ClickNextontheapplicationblade.
5. Youshouldseeanotification“…Installingappsto<cluster-name>”
6. OncetheinstallationiscompleteyoushouldseePaxataamongtheinstalledapplicationsonyourapplicationblade.
7. EitherthePortalLinknexttotheapplicationorthewebpagelink
ontheright-handsidewilltakeyouSelfServiceDataPreparationapplicationfromPaxata.
a. Ifyouareatechnicaluser,youmaywanttonotedowntheSSHURL.ThisURLisalwayslistedinthispage.
b. Also,thismaybeagoodtimetocheckoutsomeoftheUsefulLinks,especiallytheTipoftheDaylink.
8. ClickontheabovelinktogotoPaxata
9. Usethecredentialssenttoyouinthefirstemail(withinstallkey)tologintoPaxata.
SUCCESSFULINSTALLATIONOncetheinstallationiscompleteyouwillreceiveawelcomeemailstatingthatyour“30-dayfreetrialstartstoday”.ThisemailcomeswithYouTubeTipofthedayvideo.Youwillreceiveafewmoreemailsarticulatinghowtouse,administertheproducttosuccessfullycompleteyourfunctionalevaluationofPaxata.Wearealsoconstantlyinworktoaddmorevideosandbringingyouaccesstoouradministrationguides.Staytuned.
ERRORCONDITIONS–DURINGINSTALLATIONWhileinstallingPaxatayoucanrunintofourpossibleerrorconditions.Ifanyoftheerrorconditionoccurs,insteadofseeingPaxataloginscreen(imageabove),youwillseeastaticwebpagewithanerrormessage(imagebelow).
1. InstallationKeyalreadyuseda. Allinstallkeysarevalidonlyforone-time.Pleaseregister
againtogetanotherinstallationkey.Paxatacustomers,
prospectsandpartnerscangetasmanyinstallkeysastheywant.
2. InstallationKeyExpireda. Allinstallkeysarevalidforonly30daysfromthedayyou
receivethem.Pleaseregisteragaintogetanotherinstallationkey.
3. InvalidInstallationKeya. Ifyoutriedtoenteranyarbitrarytext(insteadofaninstall
keythatwasgeneratedandsenttoyou)thenyouwillgetaninvalidinstallkeyerror.
b. SimplyregisterinourwebsiteandusetheinstallationkeyyoureceivefromPaxata.
4. InternalErrora. MostlikelyreasonforthisisPaxatainstallationfilesdidnot
downloadproperly.b. Ifyouseethiserror,youcanreusetheinstallationkey.
Installationkeyisnotmarkedasusediftheinstallationfailedduetoaninternalerror.
INSTALLINGPAXATAALONGWITHTHEHDINSIGHTCLUSTERInsteadofinstallingPaxataontoponanexistingcluster,youcaninstallPaxataalongwiththeHDInsightclusteratthesametime.Therearetwowaystodoit.
STARTWITHPAXATA
1. Startfromthemarketplacehome(portal.azure.com).2. Clickonthe+Newiconontheleft.3. Searchfor“Paxata”4. ClickonPaxata
ThiswilltakeyoutoawizardwhereyoucanbuildaHDInsightClusterandinstallPaxata.Thechoicesyouwillseearethesameaslistedabove,exceptyouwillhaveonecombinedwizardtodeploytheclusterandinstallPaxata.
STARTWITHHDINSIGHTSPARKCLUSTER
1. Clickonthe+iconontheleft.2. SelectDataandAnalytics3. SelectHDInsight4. ThiswilltakeyoutotheHDInsightinstallationWizard.Expandthe
wizardbyclickingon“Custom”
5. Selectaclustersize,machinetypeandfollowPaxatainstallation
instructionsaslistedabove(underexistingclusterdeploymentsection).