At CEB (Now Gartner) Data Innovation Center we hire the brightest and most talented people in the world? Are you looking to work on the most innovative projects and build technologies that change the world? At the CEB (Now Gartner) Data Innovation Center, you have the chance to build the future.
We are looking for some smart Data Scientists who can generate insights from our data. Join us this summer to get a chance at creating some of the most innovative solutions at our data innovation center.
The Data Science Intern with minimum will focus on mid-size to large projects for development of NLP and Machine learning solutions employing a working knowledge of SDLC, SCRUM, and Agile methodologies. The engineer will design, specify, and implement a state of the art natural language processing and machine learning for our next generation products. This position emphasizes a need for to conceptualize, design, and develop reusable NLP and ML models as well as very strong technical knowledge for implementing big data technologies. Additionally, the Data Science Intern will play a key role in requirements gathering, project documentation, application developments tasks and the translation of requirements for project teams both foreign and domestic.
Requirements
· Demonstrable Experience in developing natural language processing models, corpus.
· Experience with NLP tools such as NLTK, OpenNLP, Stanford CoreNLP and similar open source solutions
· Experience with NLP tagging methods and techniques such as CCG, Penn TreeBank
· Experience with multi-lingual (international) NLP processing and tagging
· Experience with NLP applications such as tokenization, parsing, lemmatization, POS tagging techniques, Named Entity Recognition (NER) or Stanford NER (SNER)
· Experience in Topic mining using Latent Dirichlet Allocation (LDA), keywords, n-grams,tf-idf vectors
· Experience in Document Classification using different language models and similarity metrics like Word2Vec, KL-Divergence, Cosine Similarity .
· Experience with developing NLP applications such as sentiment analysis, topic modeling, text summary production
· Experience with developing NLP tools using Machine learning, Statistical analysis, bag of words, parts of speech tagging
· Ability to apply combinations of classifiers Naïve Bayes, Decision Tree, k-NN, Neural Networks and SVM.
· Experience developing and applying machine learning using tools such as Python Scikit, R or similar languages
· Experience in mining/analyzing vast data stores and uncovering insights.
· Experience in building Recommendation Systems
· Experience in writing clean, documented, modular, reusable code in Python
· Experience in data wrangling and munging using Python libraries like Numpy, Pandas, BeautifulSoup, etc.
· Experience in web development, JavaScript Framework and programming languages like Java, Scala, C++ is a plus
· Experience with Big Data tools like Hadoop, Spark is a plus
· Experience with Deep learning techniques like LSTM,RNN is a plus
Role Qualifications
· Knowledge in Python and Java.
· Writing high-performance, reliable and maintainable code.
· Proficiency in Database SQL query development and data analysis
· Excellent oral and written communication skills
· Critical thinking with excellent judgment and initiative
· Self-motivator with great attention to detail and follow through
· Proficiency with Excel, Word, PowerPoint, and Visio
· Proven analytical skills
· Ability to work independently and in a team setting
· Ability to excel in a fast paced environment
· Very good listening and interpretation skills
Preferred Qualifications
· Knowledge in Hadoop is a plus
· Managing and deploying HBase
· Good knowledge in back-end programming, specifically java, JS, Node.js and OOAD
· Knowledge on SQL Server Management Studio, GitHub, and MongoDB.
Education and Experience:
· Bachelor’s Degree or Master’s Degree or PhD in Information Systems, Information Technology, Computer Science, or Engineering or, Statistics, or Mathematics, or related discipline.