Web Scraping / Data Extraction Specialist (Login-Based Platform)

Please login or register as jobseeker to apply for this job.

TYPE OF WORK

Part Time

SALARY

TBA

HOURS PER WEEK

TBD

DATE UPDATED

Apr 6, 2026

JOB OVERVIEW

Hey,
Looking for someone sharp who can help extract data from a web platform.
At the ---------- nt, I’m running into an issue where:
• The platform shows hundreds of results (500+)
• But I can only access or copy around 20–30 records at a time
• There are multiple pages, but data isn’t easily exportable
I need someone who can:
• Extract all available data, not just what’s visible on screen
• Work with login-based platforms (authenticated sessions)
• Handle pagination / lazy loading / dynamic content
• Deliver clean data into Excel or Google Sheets
________________________________________
What you’ll likely be working with:
• JavaScript-heavy web apps
• Infinite scroll or paginated results
• Browser DevTools / Network tab
• API extraction (if available)
• Or DOM scraping if needed
________________________________________
Ideal experience:
• Web scraping tools (Python, Selenium, Puppeteer, etc.)
• Experience extracting from platforms that don’t allow easy export
• Understanding of XHR / API calls / JSON responses
• Able to work efficiently without breaking the platform
________________________________________
Deliverables:
• Full dataset (not partial pages)
• Clean, structured format (CSV / Excel)
• Ideally repeatable process if this becomes ongoing
________________________________________
Notes:
• This is not a basic copy-paste job
• Looking for someone who understands how to get around data limitations properly
• If you’ve done similar work before, mention it briefly

SKILL REQUIREMENT
VIEW OTHER JOB POSTS FROM:
SHARE THIS POST
facebook linkedin
  BENCHMARKS  
Loading Time: Base Classes  0.0012
Controller Execution Time ( Jobseekers / Job )  0.0163
Total Execution Time  0.0183
  GET DATA  
No GET data exists
  MEMORY USAGE  
1,494,136 bytes
  POST DATA  
No POST data exists
  URI STRING  
jobseekers/job/Web-Scraping-Data-Extraction-Specialist-Login-Based-Platform-1618120
  CLASS/METHOD  
jobseekers/job
  DATABASE:  onlinejobs (Jobseekers:$db)   QUERIES: 13 (0.0087 seconds)  (Hide)
0.0009   SELECT *
                                
FROM exrates
                                WHERE rate_name 
'USD-PHP' 
0.0003   SELECT *
FROM `employer_jobs`
WHERE `job_id` = 1618120
 LIMIT 1 
0.0010   SELECT *
FROM `employers`
WHERE `employer_id` = 802369
 LIMIT 1 
0.0008   SELECT COUNT(*) AS `numrows`
FROM `t_thread` `t`
LEFT JOIN `t_thread_misc` `miscON `t`.`id` = `misc`.`thread_id`
WHERE `t`.`job_id` = 1618120
AND `misc`.`idIS NULL 
0.0004   SELECT e.business_namee.logoe.websitee.rebill_datee.date_added member_datehitsDATEDIFF('2026-04-21',ej.date_added) duration_daysDATEDIFF('2026-04-21',e.rebill_date) duration_rebillej.*, e.deactivate FROM employers eemployer_jobs ej WHERE e.employer_id ej.employer_id AND
                                   ((
e.user_level >= '500' AND ej.date_added <= e.rebill_date)
                                   OR 
e.employer_id '' OR (ej.date_approved <> '2000-01-01' and DATEDIFF('2026-04-21',ej.date_added) <= 14 ))
                                   AND 
e.deactivate != AND ej.deleted AND job_id '1618120' 
0.0003   SELECT *
FROM `employer_jobs_skills` `ejs`
LEFT JOIN `skills_categories` `scON `ejs`.`skill_id` = `sc`.`id`
WHERE `job_id` = 1618120 
0.0016   UPDATE employer_jobs SET hit_counts '***Apr-06-2026=189***Apr-07-2026=218***Apr-08-2026=53***Apr-09-2026=36***Apr-10-2026=21***Apr-11-2026=19***Apr-12-2026=17***Apr-13-2026=13***Apr-14-2026=5***Apr-15-2026=4***Apr-16-2026=4***Apr-17-2026=9***Apr-18-2026=8***Apr-21-2026=1' WHERE job_id'1618120'  
0.0006   UPDATE employer_jobs SET monthly_hits '***Apr-2026=597' WHERE job_id'1618120'  
0.0009   SELECT date_sent FROM jobseeker_sent_emails WHERE jobseeker_id '' AND job_id '1618120' AND status LIKE 'sent%' ORDER BY id DESC  
0.0002   SELECT *
FROM `employer_jobs_skills` `ejs`
LEFT JOIN `skills_categories` `scON `ejs`.`skill_id` = `sc`.`id`
WHERE `job_id` = 1618120 
0.0013   SELECT COUNT(*) AS `numrows`
FROM `employer_jobs`
WHERE `employer_id` = '802369'
AND `date_added` >= '2022-06-08' 
0.0002   select from teasers 
0.0002   SELECT FROM skill_categories WHERE skill_cat_id='' 
  HTTP HEADERS  (Show)
  SESSION DATA  (Show)
  CONFIG VARIABLES  (Show)