Cloudera Hadoop Administrator
Raleigh (Wake County) Administration
Job description
Experienced Cloudera Hadoop Administrator who will be part of a global team responsible for implementation and ongoing administration of CS Big Data infrastructure.
Skill requirements:
· Intimate knowledge of non-root installation with fully integrated AD/Kerberos authentication
· Working with data delivery teams to setup new Hadoop clusters. This job includes setting up Kerberos principals and testing
· Expert level knowledge of Cloudera Hadoop components such as HDFS, Sentry, HBase, Accumulo, Impala, Hue, Spark, Hive, YARN, ZooKeeper and Postgres
· Cluster maintenance as well as creation and removal of nodes using tools like Cloudera Manager Enterprise
· Performance tuning of Hadoop clusters and Hadoop MapReduce routines
· Screen Hadoop cluster job performances and capacity planning
· Monitor Hadoop cluster connectivity and security
· Manage, Troubleshoot and review Hadoop log files
· Eyes-on-glass cluster monitoring and responding to internal tickets and email alerts. File system management and monitoring
· Setup Splunk monitoring
· HDFS and other Cloudera Hadoop components support and maintenance
· Diligently teaming with the infrastructure, network, database, application and business intelligence teams to guarantee high data quality and availability
· Collaborating with application teams to install Hadoop updates, patches, version upgrades when required
· Aligning with the engineering team to propose and deploy new hardware and software environments required for Hadoop and to expand existing environments
· Point of Contact for Vendor escalation
Desired profile
· 2-3 years of recent Big Data experience
· Hadoop Administration
· Cloudera Hadoop Administration, preferable version 5.5 or higher, or Hadoop development experience with administration experience
· Expert level knowledge of Cloudera Hadoop components such as HDFS, Sentry, HBase, Accumulo, Impala, Hue, Spark, Hive, YARN, ZooKeeper and Postgres
· Experience with monitoring clusters, Splunk, Python, Linux Shell Scripting, RHEL 6, UNIX networking, and troubleshooting.
· Cloudera Certified Administrator for Apache Hadoop (CCAH) highly desired