< Careers

Junior Data Engineer (Web Scraping)

Madrid
python, scrapy, http, sql
Apply Now
info This job requires EU working permit and residency in Madrid.

Your Job

As part of the Data Collection team, you will be proud of our data being always fresh, available and in top quality. You will contribute to the team in achieve these by:
  • Coordinating with our spider developers, QA team and Data Operations team
  • Answering requests from other teams in the company
  • Maintaining monitoring systems to keep an eye on our spiders
  • Developing feeds based on APIs and provided data

Your Skills

In order to perform the tasks needed for the role, you'll need to bring your skills and knowledge involving:
  • Python, as it's the base of our tech stack at many levels
  • How the web works: Requests, responses, user agents, proxies, HTTP protocol, robots.txt
  • Linux shell command line: Don't need to be a sysadmin but we expect you to be able to find your way around in your local machine and in any server box
  • SQL, as we work with data
  • English: A big part of the company doesn't speak spanish, so your job involves some written and spoken communication with people in other countries/timezones and with different cultures

Bonus Points

You will learn on the job other tech pieces we use. Of course, it will be easier if you are already familiar with them:
  • Scrapy, as every spider is based on this
  • Anti-bot measures for crawling and methods to avoid them
  • Jira as a regular user

The Team

Data Collection team is responsible for collecting massive amounts of data from fashion related websites and placing it into our analytics data repository. Our team takes care of developing and maintaining spiders, monitoring and checking they are running properly, checking the infrastructure is adequate and in good health and, in summary, handling good clean data to go through the rest of the pipeline. Our database already contains more than 600 millions of products, which are refreshed daily, and we see around 1 million of new products every week.

The Company

Our motto is "We love data". And we love technology that deals with data because it enables us to do incredible things... things that are valuable for our customers and that sustain a business.

StyleSage is (no longer) a startup founded 6 years ago with offices in New York and Madrid. Madrid is the home for our core technical team of around 20 people. It's an open, diverse and inclusive team of very skilled and talented individuals that are happy to collaborate, share knowledge and enjoy building great software together. We are looking forward to welcoming additional members for this team.

What We Offer

  • First and foremost: permanent contract and competitive salary.
  • Teams are made of people, not resources.
  • Open, diverse and inclusive environment.
  • A challenging and fun project to work and grow with, with the latest technologies, best practices and light speed evolution, all in a friendly, relaxed and positive environment.
  • Fixed yearly training budget to spend on english classes, courses, books, or conferences.
  • Your brand new laptop with OS of your choice (we recommend MacOSX or any flavor of Linux).
  • A team of colleagues that will share a lot of knowledge with you (we have weekly in-depth internal talks).
  • Fully remote position until it's safe to go back to our office in a co-working space quite close to Avenida de América metro station. We care about our team's health.
  • Working hours are flexible and, when we can go back to the office, you can still work remotely when needed, although you will miss the fruit and coffee/tea we stock in there.

The Joel Test

Our score: 10.5/12

The Joel Test is a quick and simple way to measure the quality of a software development team.
Do you want to join us?
1. Do you use source control?
Absolutely.
+1.0
2. Can you make a build in one step?
Yes, build & deploy.
+1.0
3. Do you make daily builds?
More than one.
+1.0
4. Do you have a bug database?
Yes, JIRA.
+1.0
5. Do you fix bugs before writing new code?
Most of the times.
+0.5
6. Do you have an up-to-date schedule?
Yes.
+1.0
7. Do you have a spec?
Yes.
+1.0
8. Do programmers have quiet working conditions?
Yes.
+1.0
9. Do you use the best tools money can buy?
Yes.
+1.0
10. Do you have testers?
Not yet.
+0.0
11. Do new candidates write code during their interview?
Yes.
+1.0
12. Do you do hallway usability testing?
Yes.
+1.0
user typing

Join our awesome team.

Fill in some details and we will contact you.
Junior Data Engineer (Web Scraping)
Madrid
Apply Now
Cookies help us better deliver our services.
By continuing, you agree to the use of our cookies.
Click here to read more about this.