Accelerate Cloud Analytics on Azure with Paxata...Accelerate Cloud Analytics on Azure with Paxata...

Preview:

Citation preview

AccelerateCloudAnalytics

onAzurewithPaxata

INSTALLINGPAXATAONAZUREMARKETPLACE

Welcome!Beitself-serviceanalyticsorexploringandprofilingyourdatalake,youhavetakentherightfirststeptowardsdemocratizingdataandinformationinyourorganizationbychoosingtoinstallPaxatainyourAzureenvironment.PaxatacustomersturnrawdataintoinformationusingPaxata’sSelf-servicedatapreparationapplication–Informationthatisclean,complete,contextualandconsumable.Nowwithone-clickinstallonAzure,and30-dayfreetriallicense,evaluatingPaxatajustgotaloteasier.ThisdocumentwalksyouthroughthenecessarystepstoprovisionPaxataonAzurewithinyourvirtualnetwork.

PRE-REQUISITEToinstallPaxataonAzure,youneedaMicrosoftaccountandAzuresubscription.Itissimpletocreateyouraccountandcreateapay-as-you-gosubscription.Ifyouworkforalargercompany,youmaywanttoaskyourITadministratortosetitupforyouandprovisionauseraccountforyouwithaccesstothesubscription.

MINIMUMREQUIREMENTS

1. ClusterTypeandVersion:ToinstallPaxata,youneedaHDInsightSparkCluster(VersionSpark2.1/HDInsight3.6)

2. HardwareRequirements:a. WorkerCPUCores:

i. ToinstallPaxata,youwillneedtohaveatleasta4CPUWorkerCores.WerecommendgettingaDorD-V2Serieshardware.

ii. Ifyouplantorunlargerinteractiveworkloads(say10MillionRows/1GB),youmayincreasethenumberofworkersupto32workercores.Thiswillgiveyougoodinteractiveresponsetime.

iii. Ifyouaregetting32cores,itisbettertogetfour8-coreVMs.Thisallowsforhigherredundancyincasethereareworkercorefailures.Atthesametimewith4cores,youwillnotpaymuchofSparkshufflecost.

b. Memoryi. Eachworkernodemusthaveatleast14GBof

memory.IfyouattempttoinstallPaxataonaclusterwithlessthan14GBworkermemory,youwillgetanerrormessage.

INSTALLINGPAXATA-THREEINSTALLPATHSTherearethreepathsonecantakeonAzureportaltoinstallPaxata.FirstvisitAzureportalatportal.azure.comandloginwithyourMicrosoftAccount.

1. FirstinstallaHDInsightsparkclusterandtheninstallPaxata.OryoucanstartfromanexistingHDInsightCluster.

2. SearchforPaxataontheportalandyouwillbeguidedtoawizardwhereyoucaninstallbothaHDInsightSparkClusterandPaxata

3. StartInstallingaHDInsightSparkClusterandyouwillbegiventheoptiontoinstallPaxataalongwithit

INSTALLPAXATAONTOPOFANEW/EXISTINGCLUSTERFirstensurethattheHDInsightclusteryouhaveprovisionedissuitableforPaxata.TheimagebelowshowstherightclustertypeyoumusthavetoinstallPaxata.Ifyouhaveanyothertypeofcluster,Paxatawillbeunavailabletobeinstalledonthecluster.

1. ClusternamebecomespartofthePaxataURL.Forexample,ifyou

nametheclusterasmycluster,theURLtoaccessyourPaxatainstallationwillbehttps://mycluster-pax.apps.azurehdinsight.net/

2. Intheclustertype,remembertoselectaLinuxbasedSparkclusterwithSparkversion2.1

3. Provideanadministratorpasswordthatyouremember.YouwillneedthistoaccesstheedgenodewherePaxatawillbeinstalled.

Thebelowimageshowsaclusterwith8workercores(2xD12v2).Youcouldstartwithaslittleas4cores(1xD14v2).AllHDInsightclusterscomestandardwithtwoheadnodes.

Onceyoustepthroughthewizardandsubmitit,yourclusterwillbereadyin15-20minutes.Onceyouhaveprovisionedacluster,youcaninstallPaxata.ClickontheApplicationslinkonyourHDInsightCluster.Inthescreenshotbelow,theapplicationslinkappearsintwoplaces.Clickingoneitheronewilltakeyoutothesameplace.Thisiscalledthe“HDInsightApplicationBlade”.

HDInsightapplicationbladeistheplacewhereyoucanseealltheapplicationsinstalledontopoftheHDInsightcluster.Inthebelowscreenshot,youcanseethattherearenoapplicationsinstalled.

Clickon+Addbuttonontopandselect“SelfServiceDataPreparationbyPaxata”.ThiswilllaunchthePaxatablade(screenshotbelow).

1. OnceyouareinPaxatablade,clickonthe“GETINSTALLKEY”link.ThiswilltakeyoutoaPaxatawebpagewitharegistrationform.

2. Whileregister,besuretoprovideavalidworkemail(e.g.

joe.foo@microsoft.com).a. Ifyouremailisassociatedwithcompany/organizationthat

isaPaxatacustomer,prospectorapartneryouwillgetaninstallkeyimmediately(inlessthan5minutes).

b. Elsesomeonewillhavetomanuallyreviewandapproveyourrequest.Onceapprovedyouwillgetaninstallkey.Thiscouldtakeabout24hours.

3. Checkyouremailinbox.YouwillreceiveaninstallkeyalongwithasetofcredentialsfromPaxata.Besuretocheckthespamfolderafter24hours.

a. Installkeyisalongstringthattypicallylookslikethis:3a6809fe-fcdf-4d8e-8ad8-bc7c48445a81

4. Onceyouhavetheinstallkey,proceedbacktotheAzureportal>Paxatablade.

a. Enterthekeyinthe“LicenseKey”field.b. Reviewthetermsofusebeforeacceptingthem.Click

Purchaseafteryouhavereviewedtheterms.c. ClickOKinthePaxatablade.

d. ClickNextontheapplicationblade.

5. Youshouldseeanotification“…Installingappsto<cluster-name>”

6. OncetheinstallationiscompleteyoushouldseePaxataamongtheinstalledapplicationsonyourapplicationblade.

7. EitherthePortalLinknexttotheapplicationorthewebpagelink

ontheright-handsidewilltakeyouSelfServiceDataPreparationapplicationfromPaxata.

a. Ifyouareatechnicaluser,youmaywanttonotedowntheSSHURL.ThisURLisalwayslistedinthispage.

b. Also,thismaybeagoodtimetocheckoutsomeoftheUsefulLinks,especiallytheTipoftheDaylink.

8. ClickontheabovelinktogotoPaxata

9. Usethecredentialssenttoyouinthefirstemail(withinstallkey)tologintoPaxata.

SUCCESSFULINSTALLATIONOncetheinstallationiscompleteyouwillreceiveawelcomeemailstatingthatyour“30-dayfreetrialstartstoday”.ThisemailcomeswithYouTubeTipofthedayvideo.Youwillreceiveafewmoreemailsarticulatinghowtouse,administertheproducttosuccessfullycompleteyourfunctionalevaluationofPaxata.Wearealsoconstantlyinworktoaddmorevideosandbringingyouaccesstoouradministrationguides.Staytuned.

ERRORCONDITIONS–DURINGINSTALLATIONWhileinstallingPaxatayoucanrunintofourpossibleerrorconditions.Ifanyoftheerrorconditionoccurs,insteadofseeingPaxataloginscreen(imageabove),youwillseeastaticwebpagewithanerrormessage(imagebelow).

1. InstallationKeyalreadyuseda. Allinstallkeysarevalidonlyforone-time.Pleaseregister

againtogetanotherinstallationkey.Paxatacustomers,

prospectsandpartnerscangetasmanyinstallkeysastheywant.

2. InstallationKeyExpireda. Allinstallkeysarevalidforonly30daysfromthedayyou

receivethem.Pleaseregisteragaintogetanotherinstallationkey.

3. InvalidInstallationKeya. Ifyoutriedtoenteranyarbitrarytext(insteadofaninstall

keythatwasgeneratedandsenttoyou)thenyouwillgetaninvalidinstallkeyerror.

b. SimplyregisterinourwebsiteandusetheinstallationkeyyoureceivefromPaxata.

4. InternalErrora. MostlikelyreasonforthisisPaxatainstallationfilesdidnot

downloadproperly.b. Ifyouseethiserror,youcanreusetheinstallationkey.

Installationkeyisnotmarkedasusediftheinstallationfailedduetoaninternalerror.

INSTALLINGPAXATAALONGWITHTHEHDINSIGHTCLUSTERInsteadofinstallingPaxataontoponanexistingcluster,youcaninstallPaxataalongwiththeHDInsightclusteratthesametime.Therearetwowaystodoit.

STARTWITHPAXATA

1. Startfromthemarketplacehome(portal.azure.com).2. Clickonthe+Newiconontheleft.3. Searchfor“Paxata”4. ClickonPaxata

ThiswilltakeyoutoawizardwhereyoucanbuildaHDInsightClusterandinstallPaxata.Thechoicesyouwillseearethesameaslistedabove,exceptyouwillhaveonecombinedwizardtodeploytheclusterandinstallPaxata.

STARTWITHHDINSIGHTSPARKCLUSTER

1. Clickonthe+iconontheleft.2. SelectDataandAnalytics3. SelectHDInsight4. ThiswilltakeyoutotheHDInsightinstallationWizard.Expandthe

wizardbyclickingon“Custom”

5. Selectaclustersize,machinetypeandfollowPaxatainstallation

instructionsaslistedabove(underexistingclusterdeploymentsection).

Recommended