Overview
Trendiction is a growing web data collector company which crawls, analyzes, indexes and stores the content of news sites, blogs, message boards and other social media, in order to make this data available to our clients in a structured format.
Our clients are market research companies, analytic companies, and generally anyone who needs access to high quality social media data.
For more information, please visit our website at
http://www.trendiction.de/w/product
We are extending our developer team by 3 or more developers. This job offer is for a permanent position at our offices at the Technoport in Esch-Sur-Alzette, Luxembourg.
Experience
Requirements and Experience:
* Java
* Good algorithm & data structure knowledge (e.g.
http://www.amazon.de/Algorithmen-Datenstrukturen-T...)
* General Web knowledge (Http, Xpath, Html, Content extraction/parsing, duplicate content detection, spam detection, encoding, clustering, sentiment detection…).
* Ability (and appreciation) for working in a small company/startup environment.
* University background (Bachelor or Master degree in computer sciences or related fields)
You should be creative, self-motivated, be able to write quick prototypes, try things out on your own and simply get things done.
A plus if you have experience setting up automated build tools and test tools, are a java expert, or have experience with search technologies (e.g. lucene)
Responsibilities
Responsibilities:
* Work/redesign new crawling technology.
* Improve automated site detection and automated content extraction of articles and comments.
What we provide
What we provide:
* Work on interesting problems/features/products involving large datasets.
* Write production code for large scale application tools like apache hadoop, cassandra, hbase, zookeeper running on over 50 servers, and processing terabytes of data each day.
* Work in a team of ETH Zurich, Harvard and TU Munich graduates.
* Start-up atmosphere and environment: actively take part in the development of the company.
How to Apply