Let’s get started
By clicking ‘Next’, I agree to the Terms of Service
and Privacy Policy, and consent to receive emails from Rise
Jobs / Job page
Data Analytics Web Scraping Engineer (part-time) image - Rise Careers
Job details

Data Analytics Web Scraping Engineer (part-time) - job 2 of 2

Overview

IDC is seeking a part-time Data Analytics Engineer for our Webscraping and Data Harvesting Team based in Ostrava, Czech Republic. This role involves supporting our established team that focuses on web crawling and gathering data from the Internet. The primary responsibilities include deploying web crawling technology to collect structured and unstructured data from various sites on a specific schedule, as well as data cleaning, classifying, validating, and unifying based on business rules and taxonomy. Additionally, the role involves enriching the data with other information and integrating it into existing products and internal business processes.

Responsibilities

  • Assisting in web crawling and data gathering for our largest data product line.
  • Supporting the evaluation, creation, and deployment of web crawling technology.
  • Helping develop machine learning algorithms with a focus on Natural Language Processing to clean, classify, and match gathered data to existing taxonomy.
  • Collaborating with internal business stakeholders to integrate scraped data into existing research processes and proprietary systems.
  • Working cross-departmentally to define metrics, guidelines, and strategies to measure data coverage and its quality.
  • Contributing to a global team in designing and building new products that aggregate and visualize scraped data from various sources.

Qualifications

  • Bachelor's Degree or equivalent in Mathematics, Computer Science, Statistics or Information Management.
  • Experience in data engineering or roles related to data engineering.
  • Demonstrated strong technical knowledge of object-oriented programming in Python.
  • Strong analytic skills related to working with unstructured datasets.
  • SQL knowledge and experience working with relational databases.
  • Proven ability to work independently and ensure completion of tasks accurately and on time.
  • Strong English communication skills in both verbal and written form.
  • Open to learn new technologies and tools.

Preferred Qualifications:

  • 1+ years of experience in machine learning or natural language processing.
  • Experience using technologies including Browse.ai.
  • Python-Scrapy, Octoparse, Beautiful Soup, Mozenda, NLTK, PostgreSQL/Snowflake.

 

 

Perks & Benefits

  • 5 weeks of holidays + extra corporate day off
  • Sick days
  • Flexibility to work from home most of the week
  • Certain flexibility to schedule your working hours
  • Cafeteria system (use points on Flexipasses, pension/life insurance, or Multisport card)
  • Meal allowance 

 

 

 

IDC is an Equal Opportunity Employer. Applicants and employees are considered for positions and are evaluated without regard to mental or physical disability, handicap, race, color, religion, gender, gender identity and expression, ancestry, national origin, age, genetic information, military or veteran status, sexual orientation, marital status or other categories protected by law.

 

Why IDC ?IDC is the most respected global technology market research firm. We are changing the way the world thinks about the impact of technology on business and society. Our people, data, and analytics create global technology insights that accelerate customer success. IDC has been recognized for five consecutive years (2020, 2021, 2022, 2023, 2024) by the IIAR as the Analyst Firm of the Year which is one of the highest accolades for the technology market research industry.

 

Our collaborative, innovative and entrepreneurial culture is the perfect place for you to discover your future.

 

This position is part-time and is based in our Prague or Ostrava offices, with a Hybrid work schedule.

 

Recruitment Fraud Notice: IDG/IDC would like to inform you that we conduct our formal communications via corporate email, our Applicant Tracking System iCIMS, LinkedIn messaging, or directly by phone. We do not use any other platform (including Telegram, WhatsApp, Signal, text, instant message, etc.) to communicate with prospective candidates. If you receive any communication outside of our formal communications channels, please ignore it and block the sender or caller. In addition, we do not ask candidates to provide sensitive personally identifiable information such as bank account or social security numbers. If you have been contacted by someone claiming to represent a job offer, please report it as potential job fraud to law enforcement.

Average salary estimate

$45000 / YEARLY (est.)
min
max
$30000K
$60000K

If an employer mentions a salary or salary range on their job, we display it as an "Employer Estimate". If a job has no salary data, Rise displays an estimate if available.

Similar Jobs
Posted 3 hours ago

Become part of CommonSpirit Health's Nurse Residency Program, designed to transition new grad nurses into exceptional caregivers through tailored mentorship and support.

UNAVAILABLE Hybrid SANTA MARIA
Posted 3 hours ago

Join Marian Regional Medical Center as an Infusion RN to deliver compassionate care and support to patients undergoing treatment.

PSU Hybrid Penn State University Park
Posted 12 days ago

Join Penn State's Data Modeling team as a Business Intelligence Specialist and help transform institutional data into actionable insights.

Photo of the Rise User
Seeq Remote No location specified
Posted 11 days ago

Join Seeq as a Senior Analytics Engineer, leveraging your expertise to drive business value through advanced analytics in the industrial process data sector.

Photo of the Rise User
Posted 7 days ago

Join T-Systems as a Senior Data Engineer and contribute to innovative projects while enjoying a flexible hybrid work model.

Photo of the Rise User

Join U.S. Bank as a Business Modeling Analyst & Developer, where you will enhance data models and work in a dynamic environment.

Posted 2 days ago

Join Fred Hutchinson Cancer Center as a Senior Data Coordinator to manage and analyze clinical trial data in a collaborative environment focused on groundbreaking cancer research.

Posted 12 days ago

In the heart of New York, FreeWheel seeks a Data Partnerships Manager to enhance relationships and drive strategic initiatives in the advertising marketplace.

TWG Global Hybrid No location specified
Posted 3 days ago

Join TWG Global as a Data Product Lead and spearhead transformative data initiatives within our innovative AI-driven company.

Photo of the Rise User

Lead a dynamic team at LinkedIn as a Senior Manager, shaping data-driven enablement strategies to transform operational efficiency.

Photo of the Rise User
Inclusive & Diverse
Empathetic
Collaboration over Competition
Growth & Learning
Transparent & Candid
Medical Insurance
Dental Insurance
Mental Health Resources
Life insurance
Disability Insurance
Child Care stipend
Employee Resource Groups
Learning & Development
Photo of the Rise User
Posted 2 months ago

Join ABC Legal Services as a Data Entry Specialist where you can work remotely and support our team in the legal document filing process.

MATCH
Calculating your matching score...
DEPARTMENTS
SENIORITY LEVEL REQUIREMENT
TEAM SIZE
No info
HQ LOCATION
No info
EMPLOYMENT TYPE
Part-time, hybrid
DATE POSTED
April 15, 2025

Subscribe to Rise newsletter

Risa star 🔮 Hi, I'm Risa! Your AI
Career Copilot
Want to see a list of jobs tailored to
you, just ask me below!
LATEST ACTIVITY
Photo of the Rise User
500+ people applied to Entry Data Entry Jobs at UPS
Photo of the Rise User
1000+ people applied to Data Entry Clerk Typing at Talentify.io
S
Someone from OH, Delaware just viewed Scheduling Coordinator - FT at SeaWorld Entertainment
K
Someone from OH, Columbus just viewed Sales Associate-ANN at Knitwell Group
V
Someone from OH, Perrysburg just viewed Junior Project Manager * at Virtual Service Operations
S
Someone from OH, Cincinnati just viewed STNA 7AM-7PM (Assisted Living) at Senior Lifestyle
Photo of the Rise User
Someone from OH, Cincinnati just viewed Mail Sorter at Staffmark
Photo of the Rise User
Someone from OH, Toledo just viewed Event Assistant at Datadog