About the Team
Yahoo is looking for engineers to join the Yahoo Hadoop team. The team has openings in Champaign IL. In this role you would be working on developing the next generation Big Data stack utilizing Hadoop for Batch, Storm for real time, and Spark for Iterative. Hadoop is the de-facto operating system for the worldwide cloud computing industry. Yahoo is the birthplace of Hadoop and has the largest Hadoop install in the world. The server count has grown from a few hundred nodes, to over 30,000 in the past few years. We have over 400,000 cores that process nearly 300PB of data spread over 25 clusters running a mix of Hadoop, Storm, and Spark.
About the Role
We are looking for experienced and motivated software engineers to help build a highly scalable and robust next generation Big Data stack to power Yahoo's data processing needs. The qualified engineer would work on one or more of the following open source projects: Hadoop Core (Common, Mapreduce, YARN, HDFS), Storm, or Spark.
If you have strong distributed systems background, love to solve complex and challenging problems, can work independently, and want to participate in some of the most exciting open source projects, we want to hear from you!. If you want to learn about Hadoop, Storm, or Spark and get a deep understanding of cloud computing, this is the job for you.
Responsibilities:
- Help Yahoo design the next generation cloud
- Understand all aspects of Yahoo's Big Data Stack (Hadoop, Storm, Spark) and learn select components in detail
- Be a leader in open source (Apache Software Foundation) projects
- Design massively distributed technology and develop leading edge cloud computing software
- Work closely with service engineering, operations, and Hadoop users to engineer cutting-edge solutions that allow Yahoo to answer today's most challenging big data questions.
Qualifications
- BS/MS in Computer Science (or equivalent)
- Strong in Java or C++
- Deep understanding of Algorithms, Data Structures, and Performance Optimization Techniques
- Experience with one or more of the following: Hadoop Core, Pig, Hive, Hcat, Oozie, Hbase, Spark, or Storm
- Experience with large distributed data and systems
- Knowledge of machine learning techniques
- Knowledge of database internals and query optimization
Please send resumes to resume-hadoop@yahoo-inc.com