Amazon Elastic MapReduce (EMR) is a web service that provides a managed framework to run data processing frameworks such as Apache Hadoop, Apache Spark, and Presto in an easy, cost-effective, and secure manner. It is used for data analysis, web indexing, data warehousing, financial analysis, scientific simulation, etc. How to Set Up Amazon EMR? Follow these steps to set up Amazon EMR − Step 1 − Sign in to AWS account and select Amazon EMR on management console. Step 2 − Create Amazon S3 bucket for cluster logs & output data. (Procedure is explained in detail in Amazon S3 section) Step 3 − Launch Amazon EMR cluster. Following are the steps to create cluster and launch it to EMR. Use this link to open Amazon EMR console − https://console.aws.amazon. com/ elasticmapreduce/home Select create cluster and provide the required details on Cluster Configuration page. Leave the Tags section options as default and proceed. On the Software configuration section, le...
Comments
Post a Comment