Title: Using Boto3 to create EMR cluster. · Issue #8 · aws-samples/aws-python-sample · GitHub
Open Graph Title: Using Boto3 to create EMR cluster. · Issue #8 · aws-samples/aws-python-sample
X Title: Using Boto3 to create EMR cluster. · Issue #8 · aws-samples/aws-python-sample
Description: Hi All, I am trying to automate the EMR cluster creation using Boto3. Which i am using to create the EMR cluster. I need a cluster created with Impala configured. Here is the parmas i passed to run_job_flow Name='AutmateEMR', ReleaseLabe...
Open Graph Description: Hi All, I am trying to automate the EMR cluster creation using Boto3. Which i am using to create the EMR cluster. I need a cluster created with Impala configured. Here is the parmas i passed to run...
X Description: Hi All, I am trying to automate the EMR cluster creation using Boto3. Which i am using to create the EMR cluster. I need a cluster created with Impala configured. Here is the parmas i passed to run...
Opengraph URL: https://github.com/aws-samples/aws-python-sample/issues/8
X: @github
Domain: patch-diff.githubusercontent.com
{"@context":"https://schema.org","@type":"DiscussionForumPosting","headline":"Using Boto3 to create EMR cluster.","articleBody":"Hi All,\n\nI am trying to automate the EMR cluster creation using Boto3. Which i am using to create the EMR cluster. I need a cluster created with Impala configured.\nHere is the parmas i passed to run_job_flow \nName='AutmateEMR',\n ReleaseLabel='emr-4.6.0',\n Instances={\n 'InstanceGroups': [{'InstanceCount':4,'InstanceRole':'CORE','InstanceType':'r3.8xlarge','Name':'slave'},{'InstanceCount':1,'InstanceRole':'MASTER','InstanceType':'r3.8xlarge','Name':'master'}],\n 'Ec2KeyName': 'MyKey',\n 'KeepJobFlowAliveWhenNoSteps': True,\n 'TerminationProtected': False,\n 'Ec2SubnetId': 'id',\n 'EmrManagedMasterSecurityGroup': 'value',\n 'EmrManagedSlaveSecurityGroup': 'value',\n 'ServiceAccessSecurityGroup': 'value',\n },\n BootstrapActions=[{'Name': 'Install Impala2','ScriptBootstrapAction': {'Path': 's3://coeus/bigtop/impala/impala-install'}}],\n Applications=[{'Name':'Hadoop','Name':'Spark','Name':'Ganglia','Name':'Hive','Name':'Presto-Sandbox'}],\n JobFlowRole='EMR_EC2_DefaultRole',\n ServiceRole='EMR_DefaultRole',\n VisibleToAllUsers=True|False,\n Tags=[{\"Key\":\"owner\",\"Value\":\"myname\"}],\n Configurations=[{\"Classification\":\"hadoop-env\",\"Properties\":{},\"Configurations\":[{\"Classification\":\"export\",\"Properties\":{\"JAVA_HOME\":\"/usr/lib/jvm/java-1.8.0\"},\"Configurations\":[]}]},{\"Classification\":\"spark-env\",\"Properties\":{},\"Configurations\":[{\"Classification\":\"export\",\"Properties\":{\"JAVA_HOME\":\"/usr/lib/jvm/java-1.8.0\"},\"Configurations\":[]}]}]\n\nThis code successfully creates the cluster but when i try to run the MapR jobs like distcp on the cluster it throws this error\n\"Error: Could not find or load main class org.apache.hadoop.mapreduce.v2.app.MRAppMaster\"\n\nI created the cluster using the console and passing same parameters the cluster gets created and I am able to run the MapR commands (Distcp) without having any issues. I am not sure why does EMR cluster created with Boto3 has the issues with hadoop config.\n\nHere is the cli export of the cluster i created using the console.\n\naws emr create-cluster --applications Name=Hadoop Name=Spark Name=Ganglia Name=Presto-Sandbox Name=Hive --bootstrap-actions '[{\"Path\":\"s3://coeus/bigtop/impala/impala-install\",\"Name\":\"Custom action\"}]' --tags 'owner=myname' --ec2-attributes '{\"KeyName\":\"mykey\",\"InstanceProfile\":\"EMR_EC2_DefaultRole\",\"ServiceAccessSecurityGroup\":\"\",\"SubnetId\":\"\",\"EmrManagedSlaveSecurityGroup\":\"\",\"EmrManagedMasterSecurityGroup\":\"\"}' --service-role EMR_DefaultRole --release-label emr-4.6.0 --log-uri ' ' --name 'automate' --instance-groups '[{\"InstanceCount\":1,\"InstanceGroupType\":\"MASTER\",\"InstanceType\":\"r3.8xlarge\",\"Name\":\"master\"},{\"InstanceCount\":4,\"InstanceGroupType\":\"CORE\",\"InstanceType\":\"r3.8xlarge\",\"Name\":\"slave\"}]' --configurations '[{\"Classification\":\"hadoop-env\",\"Properties\":{},\"Configurations\":[{\"Classification\":\"export\",\"Properties\":{\"JAVA_HOME\":\"/usr/lib/jvm/java-1.8.0\"},\"Configurations\":[]}]},{\"Classification\":\"spark-env\",\"Properties\":{},\"Configurations\":[{\"Classification\":\"export\",\"Properties\":{\"JAVA_HOME\":\"/usr/lib/jvm/java-1.8.0\"},\"Configurations\":[]}]}]' --region\n\nI am out of ideas why it should be happening. any help is highly appreciated.\n","author":{"url":"https://github.com/rahul22022","@type":"Person","name":"rahul22022"},"datePublished":"2016-09-19T23:52:04.000Z","interactionStatistic":{"@type":"InteractionCounter","interactionType":"https://schema.org/CommentAction","userInteractionCount":3},"url":"https://github.com/8/aws-python-sample/issues/8"}
| route-pattern | /_view_fragments/issues/show/:user_id/:repository/:id/issue_layout(.:format) |
| route-controller | voltron_issues_fragments |
| route-action | issue_layout |
| fetch-nonce | v2:9d52efe3-dd23-672b-b2eb-7c13e5a894a6 |
| current-catalog-service-hash | 81bb79d38c15960b92d99bca9288a9108c7a47b18f2423d0f6438c5b7bcd2114 |
| request-id | DE8E:128C7A:13F31C1:19F7D77:698167F2 |
| html-safe-nonce | 2c5b6c6b4e0512da760e7f753e2484ea886272bb1b0c28fa848248cb56bfb395 |
| visitor-payload | eyJyZWZlcnJlciI6IiIsInJlcXVlc3RfaWQiOiJERThFOjEyOEM3QToxM0YzMUMxOjE5RjdENzc6Njk4MTY3RjIiLCJ2aXNpdG9yX2lkIjoiNjU0NDg5NjM3NDAyNTA1NDE5NCIsInJlZ2lvbl9lZGdlIjoiaWFkIiwicmVnaW9uX3JlbmRlciI6ImlhZCJ9 |
| visitor-hmac | 73933885d24fb6045174706c4e9ad393cb1cb9606fc5a6f986e27ac457be9112 |
| hovercard-subject-tag | issue:177928061 |
| github-keyboard-shortcuts | repository,issues,copilot |
| google-site-verification | Apib7-x98H0j5cPqHWwSMm6dNU4GmODRoqxLiDzdx9I |
| octolytics-url | https://collector.github.com/github/collect |
| analytics-location | / |
| fb:app_id | 1401488693436528 |
| apple-itunes-app | app-id=1477376905, app-argument=https://github.com/_view_fragments/issues/show/aws-samples/aws-python-sample/8/issue_layout |
| twitter:image | https://opengraph.githubassets.com/212662c52ab56ef8be24124b1406cb3280812184dc6b624c88208ecd316dab10/aws-samples/aws-python-sample/issues/8 |
| twitter:card | summary_large_image |
| og:image | https://opengraph.githubassets.com/212662c52ab56ef8be24124b1406cb3280812184dc6b624c88208ecd316dab10/aws-samples/aws-python-sample/issues/8 |
| og:image:alt | Hi All, I am trying to automate the EMR cluster creation using Boto3. Which i am using to create the EMR cluster. I need a cluster created with Impala configured. Here is the parmas i passed to run... |
| og:image:width | 1200 |
| og:image:height | 600 |
| og:site_name | GitHub |
| og:type | object |
| og:author:username | rahul22022 |
| hostname | github.com |
| expected-hostname | github.com |
| None | e137814e266030874fd2c86863529d0622b13889eeda04148c57654b6ea84ad6 |
| turbo-cache-control | no-preview |
| go-import | github.com/aws-samples/aws-python-sample git https://github.com/aws-samples/aws-python-sample.git |
| octolytics-dimension-user_id | 8931462 |
| octolytics-dimension-user_login | aws-samples |
| octolytics-dimension-repository_id | 12929872 |
| octolytics-dimension-repository_nwo | aws-samples/aws-python-sample |
| octolytics-dimension-repository_public | true |
| octolytics-dimension-repository_is_fork | false |
| octolytics-dimension-repository_network_root_id | 12929872 |
| octolytics-dimension-repository_network_root_nwo | aws-samples/aws-python-sample |
| turbo-body-classes | logged-out env-production page-responsive |
| disable-turbo | false |
| browser-stats-url | https://api.github.com/_private/browser/stats |
| browser-errors-url | https://api.github.com/_private/browser/errors |
| release | dd58d68a7813bbec9c91422c8c35f4af33832d70 |
| ui-target | full |
| theme-color | #1e2327 |
| color-scheme | light dark |
Links:
Viewport: width=device-width