Python Developer (Scrapy + Zyte)

Please login or register as jobseeker to apply for this job.

: TYPE OF WORK

Gig

: WAGE / SALARY

100000

: HOURS PER WEEK

8

: DATE UPDATED

May 25, 2025

JOB OVERVIEW

We are seeking a Senior Python Developer with expertise in Scrapy and API-based data extraction to build scalable, reliable scrapers deployed on Zyte Scrapy Cloud. You will primarily utilise websites' APIs to ensure efficiency and scalability, while managing data pipelines and ensuring compliance with ethical and legal standards.

Key Responsibilities
- Develop and deploy Scrapy spiders on Zyte Scrapy Cloud with a focus on API-based data extraction.
- Handle API challenges like rate limiting, pagination, and authentication (e.g., API keys, OAuth).
- Optimise spiders for performance, scalability, and minimal API/server load.
- Use Zyte Smart Proxy Manager for IP rotation and session handling as needed.
- Build and maintain pipelines for cleaning, validating, and storing data in cloud databases.
- Monitor scraper performance and resolve issues via Zyte Cloud’s tools.
- Ongoing management/ingestion of data into our data pipelines.
- Ensure compliance with terms of service and ethical scraping practices.

Requirements
- 5+ years of Python development experience with 3+ years in Scrapy.
- Proven expertise in API integration (REST, GraphQL) and challenges like rate limiting.
- Experience deploying and monitoring spiders on Zyte Scrapy Cloud.
- Proficiency in working with databases (e.g., PostgreSQL,) and optimising Scrapy pipelines.
- Familiarity with Zyte Smart Proxy Manager and handling anti-scraping techniques.
- Strong understanding of HTTP, JSON, and web technologies.

Preferred Skills
- Experience with GraphQL APIs and Zyte AutoExtract API.
- Familiarity with JavaScript-heavy page rendering (e.g., Splash, Playwright).
- Knowledge of distributed scraping or big data tools (e.g., Scrapy-Cluster, Airflow).

Benefits
Competitive salary and bonuses.
Flexible working hours and remote-friendly environment.
Opportunity to work on innovative, large-scale data extraction projects.

SKILL REQUIREMENT

Python Data scraping Web Scraping

SHARE THIS POST

Job Post	Reporter	Rank	Reason	Date

BENCHMARKS

Loading Time: Base Classes	0.0008
Controller Execution Time ( Jobseekers / Job )	0.0712
Total Execution Time	0.0725

GET DATA

No GET data exists

MEMORY USAGE

1,532,528 bytes

POST DATA

No POST data exists

URI STRING

jobseekers/job/Python-Developer-Scrapy-Zyte-1384483

CLASS/METHOD

jobseekers/job

DATABASE: onlinejobs (Jobseekers:$db) QUERIES: 13 (0.0656 seconds) (Hide)

0.0003	`SELECT * FROM exrates WHERE rate_name = 'USD-PHP'`
0.0009	SELECT * FROM `employer_jobs` WHERE `job_id` = 1384483 LIMIT 1
0.0017	SELECT * FROM `employers` WHERE `employer_id` = 471090 LIMIT 1
0.0122	SELECT COUNT(DISTINCT t.id) as cnt FROM `t_thread` `t` INNER JOIN `t_message` `m` ON `t`.`id` = `m`.`thread_id` INNER JOIN `t_message_employer` `e` ON `m`.`id` = `e`.`message_id` LEFT JOIN `t_thread_misc` `misc` ON `t`.`id` = `misc`.`thread_id` WHERE `t`.`job_id` = 1384483 AND `misc`.`id` IS NULL
0.0005	SELECT e.business_name, e.logo, e.website, e.rebill_date, e.date_added member_date, hits, DATEDIFF('2026-06-21',ej.date_added) duration_days, DATEDIFF('2026-06-21',e.rebill_date) duration_rebill, ej.*, e.deactivate FROM employers e, employer_jobs ej WHERE e.employer_id = ej.employer_id AND ((e.user_level >= '500' AND ej.date_added <= e.rebill_date) OR e.employer_id = '' OR (ej.date_approved <> '2000-01-01' and DATEDIFF('2026-06-21',ej.date_added) <= 14 )) AND e.deactivate != 1 AND ej.deleted = 0 AND job_id = '1384483'
0.0008	SELECT * FROM `employer_jobs_skills` `ejs` LEFT JOIN `skills_categories` `sc` ON `ejs`.`skill_id` = `sc`.`id` WHERE `job_id` = 1384483
0.0017	UPDATE employer_jobs SET hit_counts = '*May-25-2025=145May-26-2025=47May-27-2025=20May-28-2025=11May-29-2025=5May-30-2025=5May-31-2025=5Jun-01-2025=7Jun-02-2025=8Jun-03-2025=3Jun-04-2025=6Jun-05-2025=4Jun-06-2025=3Jun-07-2025=3Jun-09-2025=2Jun-10-2025=2Jun-11-2025=4Jun-12-2025=2Jun-13-2025=4Jun-16-2025=1Jun-17-2025=2Jun-18-2025=2Jun-19-2025=4Jun-20-2025=1Jun-23-2025=1Jun-24-2025=2Jun-26-2025=2Jun-27-2025=3Jun-28-2025=1Jun-30-2025=2Jul-01-2025=1Jul-02-2025=7Jul-03-2025=3Jul-04-2025=1Jul-07-2025=3Jul-08-2025=2Jul-09-2025=4Jul-10-2025=2Jul-11-2025=1Jul-12-2025=1Jul-13-2025=1Jul-14-2025=2Jul-15-2025=2Jul-16-2025=1Jul-17-2025=2Jul-18-2025=1Jul-19-2025=1Jul-23-2025=3Jul-24-2025=1Jul-28-2025=1Jul-29-2025=2Jul-30-2025=1Jul-31-2025=3Aug-04-2025=2Aug-11-2025=9Aug-12-2025=4Aug-13-2025=4Aug-14-2025=2Aug-15-2025=2Aug-17-2025=3Aug-18-2025=1Aug-19-2025=1Aug-22-2025=1Aug-27-2025=2Aug-28-2025=3Aug-30-2025=2Aug-31-2025=2Sep-01-2025=1Sep-02-2025=2Sep-03-2025=2Sep-05-2025=1Sep-07-2025=1Sep-08-2025=7Sep-11-2025=1Sep-13-2025=2Sep-15-2025=3Sep-16-2025=2Sep-17-2025=1Sep-18-2025=2Sep-20-2025=1Sep-21-2025=2Sep-22-2025=1Sep-25-2025=2Sep-28-2025=1Oct-01-2025=1Oct-02-2025=1Oct-03-2025=2Oct-04-2025=1Oct-05-2025=1Oct-06-2025=1Oct-08-2025=1Oct-09-2025=1Oct-11-2025=1Oct-12-2025=1Oct-14-2025=1Oct-17-2025=1Oct-18-2025=1Oct-19-2025=1Oct-20-2025=1Oct-21-2025=1Oct-22-2025=3Oct-24-2025=1Oct-25-2025=1Oct-26-2025=1Oct-27-2025=3Oct-28-2025=4Oct-29-2025=2Oct-30-2025=3Oct-31-2025=3Nov-01-2025=5Nov-02-2025=3Nov-03-2025=6Nov-04-2025=6Nov-05-2025=1Nov-07-2025=2Nov-13-2025=3Nov-14-2025=3Nov-15-2025=1Nov-19-2025=1Nov-20-2025=1Nov-25-2025=1Nov-30-2025=2Dec-02-2025=1Dec-03-2025=2Dec-05-2025=1Dec-06-2025=1Dec-07-2025=2Dec-08-2025=2Dec-10-2025=2Dec-13-2025=2Dec-14-2025=1Dec-15-2025=2Dec-18-2025=1Dec-19-2025=2Dec-20-2025=2Dec-21-2025=1Dec-25-2025=1Dec-30-2025=1Dec-31-2025=2Jan-03-2026=1Jan-05-2026=1Jan-08-2026=1Jan-09-2026=2Jan-13-2026=5Jan-15-2026=2Jan-22-2026=1Jan-27-2026=1Jan-28-2026=1Jan-30-2026=1Feb-02-2026=2Feb-04-2026=1Feb-08-2026=1Feb-19-2026=1Feb-24-2026=2Feb-27-2026=1Feb-28-2026=1Mar-03-2026=2Mar-06-2026=1Mar-08-2026=1Mar-10-2026=1Mar-11-2026=1Mar-13-2026=2Mar-17-2026=2Mar-18-2026=1Mar-19-2026=1Mar-20-2026=2Mar-22-2026=2Mar-23-2026=3Mar-24-2026=1Mar-25-2026=1Mar-26-2026=3Mar-27-2026=2Mar-31-2026=1Apr-01-2026=2Apr-02-2026=4Apr-03-2026=2Apr-04-2026=2Apr-05-2026=1Apr-06-2026=3Apr-07-2026=3Apr-08-2026=2Apr-09-2026=2Apr-10-2026=1Apr-11-2026=3Apr-13-2026=1Apr-15-2026=2Apr-16-2026=1Apr-17-2026=3Apr-18-2026=3Apr-19-2026=2Apr-20-2026=1Apr-21-2026=7Apr-22-2026=1Apr-23-2026=5Apr-24-2026=2Apr-25-2026=3Apr-26-2026=1Apr-27-2026=1Apr-28-2026=2Apr-29-2026=2Apr-30-2026=2May-01-2026=1May-02-2026=1May-03-2026=3May-04-2026=3May-05-2026=3May-06-2026=3May-07-2026=1May-08-2026=1May-09-2026=3May-10-2026=1May-11-2026=3May-14-2026=1May-17-2026=2May-23-2026=1May-25-2026=1May-27-2026=1May-28-2026=1Jun-02-2026=1Jun-05-2026=1Jun-08-2026=1Jun-10-2026=1Jun-14-2026=2Jun-15-2026=1Jun-16-2026=1Jun-18-2026=2Jun-20-2026=1*Jun-21-2026=1' WHERE job_id= '1384483'
0.0007	`UPDATE employer_jobs SET monthly_hits = '*May-2025=238Jun-2025=69Jul-2025=46Aug-2025=38Sep-2025=32Oct-2025=38Nov-2025=35Dec-2025=26Jan-2026=16Feb-2026=9Mar-2026=27Apr-2026=64May-2026=30*Jun-2026=12' WHERE job_id= '1384483'`
0.0008	`SELECT date_sent FROM jobseeker_sent_emails WHERE jobseeker_id = '' AND job_id = '1384483' AND status LIKE 'sent%' ORDER BY id DESC`
0.0003	SELECT * FROM `employer_jobs_skills` `ejs` LEFT JOIN `skills_categories` `sc` ON `ejs`.`skill_id` = `sc`.`id` WHERE `job_id` = 1384483
0.0450	SELECT COUNT(*) AS `numrows` FROM `employer_jobs` WHERE `employer_id` = '471090' AND `date_added` >= '2022-06-08'
0.0004	`select * from teasers`
0.0002	`SELECT * FROM skill_categories WHERE skill_cat_id=''`

HTTP HEADERS (Show)

SESSION DATA (Show)

__ci_last_regenerate	1782003404
last_page	https://v1.stage.onlinejobs.ph/jobseekers/job/Python-Developer-Scrapy-Zyte-1384483
csrf-token	3ba38733807c709b1e02528cef073552

CONFIG VARIABLES (Show)

base_url	https://v1.stage.onlinejobs.ph
log_threshold	2
enable_profiler
index_page
uri_protocol	PATH_INFO
url_suffix
language	english
charset	UTF-8
enable_hooks	1
subclass_prefix	MY_
composer_autoload	vendor/autoload.php
permitted_uri_chars	a-z 0-9~%.:_\=\-\+\@?\&
enable_query_strings
controller_trigger	c
function_trigger	m
directory_trigger	d
allow_get_array	1
log_path
log_file_extension
log_file_permissions	420
log_date_format	Y-m-d H:i:s
error_views_path
cache_path
cache_query_string
encryption_key	OdBUArjiWg9I7u7bvAwQ7Fu35VB1kzga
sess_driver	files
sess_cookie_name	ci_session
sess_expiration	2678400
sess_save_path
sess_match_ip
sess_time_to_update	300
sess_regenerate_destroy
cookie_prefix
cookie_domain
cookie_path	/
cookie_secure
cookie_httponly
standardize_newlines
global_xss_filtering	1
csrf_protection
csrf_token_name	csrf_test_name
csrf_cookie_name	csrf_cookie_name
csrf_expire	7200
csrf_regenerate	1
csrf_exclude_uris	Array ( )
compress_output
time_reference	local
rewrite_short_tags
proxy_ips
ojadmin	JpjZkQN5A8l@^L
salt	+UFjSAT49tPZLtmU2CIG2FYN7pRhgsWyLHgSyQa6k3I=
queue_enabled
twilio	Array ( [sid] => AC00c0594045c6eef9407e8fff01f3d467 [token] => f0bfc0b73444a077894a43f5f75e6d41 [length] => 6 [from] => +639221200200 [code_expiry_seconds] => 14400 )
maintenance	Array ( [admin] => Array ( [verification] => ) [announcement_bar] => Array ( [show_slow_issue_message] => ) )
trustpilot	Array ( [to_email] => onlinejobs.ph+3d844ebf71@invite.trustpilot.com )
services	Array ( [google_tag_manager] => Array ( [id] => GTM-T5CQMS6P ) [chatgpt] => Array ( [secret] => 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 ) [claude] => Array ( [secret] => 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 ) )
v2_url	https://v2.onlinejobs.ph
v2	Array ( [enabled] => 1 [url] => https://v2.stage.app.onlinejobs.ph [api_url] => https://v2.stage.api.onlinejobs.ph [cookie_domain] => .onlinejobs.ph [paths] => Array ( [myaccount] => 1 ) )
replacemyself_secret_key	Kk1WpgTMkc4wBQOqC5rqCssdLhACKrsJeTtO1ywUkT4=
app	Array ( [app] => Array ( [command] => Array ( [map] => Array ( [OnlineJobs\Mailer\Command\SendEmailCommand] => OnlineJobs\Mailer\Handler\SendEmailHandler [OnlineJobs\Mail\Command\SendEmailCommand] => OnlineJobs\Mail\Handler\SendEmailHandler [OnlineJobs\Mail\Command\QueueEmailPendingCommand] => OnlineJobs\Mail\Handler\QueueEmailPendingHandler [OnlineJobs\Mail\Command\QueueJobEmailPendingCommand] => OnlineJobs\Mail\Handler\QueueJobEmailPendingHandler [OnlineJobs\Mail\Command\QueueUnapprovedEmailCommand] => OnlineJobs\Mail\Handler\QueueUnapprovedEmailHandler [OnlineJobs\Mail\Command\ProcessEmailQueueCommand] => OnlineJobs\Mail\Handler\ProcessEmailQueueHandler [OnlineJobs\Mail\Command\PutJobseekerEmailOnHoldCommand] => OnlineJobs\Mail\Handler\PutJobseekerEmailOnHoldHandler [OnlineJobs\Purchase\Command\RealEstateVaCourseCommand] => OnlineJobs\Purchase\Handler\RealEstateVaCourseHandler [OnlineJobs\Test\Command\HelloCommand] => OnlineJobs\Test\Handler\HelloHandler ) [middleware] => Array ( ) ) [providers] => Array ( [0] => OnlineJobs\Bus\BusServiceProvider [1] => OnlineJobs\Database\DbServiceProvider [2] => OnlineJobs\Mailer\MailerServiceProvider [3] => OnlineJobs\Support\SupportServiceProvider [4] => OnlineJobs\Employer\ServiceProvider [5] => OnlineJobs\Jobseeker\ServiceProvider [6] => OnlineJobs\Mail\ServiceProvider [7] => OnlineJobs\Postal\ServiceProvider [8] => OnlineJobs\Queue\ServiceProvider [9] => OnlineJobs\Courier\ServiceProvider [10] => OnlineJobs\Job\ServiceProvider [11] => OnlineJobs\Purchase\ServiceProvider ) ) )
meta	Array ( [facebook] => Array ( [input] => 1 [connect] => 2 [timezone] => 3 [url] => 30 [id] => 31 [photo] => 32 [name] => 33 [email] => 34 [review] => 35 ) [trust] => Array ( [government_id] => 4 [utility_bill] => 5 [selfie_photo] => 6 [profile_picture] => 7 [phone_number] => 8 [facebook] => 9 ) [verify] => Array ( [profile_picture] => 10 [government_id] => 11 [selfie_photo] => 12 [utility_bill] => 13 [address] => 14 [name] => 15 [reviewed] => 16 [government_group] => 28 [address_group] => 29 [mobile] => 38 ) [address] => Array ( [complete] => 17 [room_unit_no] => 20 [house_no] => 21 [street] => 22 [subdivision] => 23 [barangay] => 24 [city] => 25 [province] => 26 [postcode] => 27 ) [reviewed] => Array ( [similar_names] => 1 ) [name] => 18 [phone] => 19 [sms_verification_code] => 36 [resend_verification_code] => 37 [address_read] => 39 [first_name] => 100 [middle_name] => 101 [last_name] => 102 [edit] => Array ( [government_id_group] => 40 [address_group] => 41 [mobile_group] => 42 ) [salary_re_entry] => 103 [id_proof_recalibrate] => 104 [email] => 44 [mobile_verification_type] => 45 )
honeypot	Array ( [name_field] => my_name [enabled] => 1 )

HTTP_ACCEPT	/
HTTP_USER_AGENT	Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)
HTTP_CONNECTION	Upgrade
SERVER_PORT	80
SERVER_NAME	v1.stage.onlinejobs.ph
REMOTE_ADDR	127.0.0.1
SERVER_SOFTWARE	Apache/2.4.58 (Ubuntu)
HTTP_ACCEPT_LANGUAGE
SCRIPT_NAME	/index.php
REQUEST_METHOD	GET
HTTP_HOST
REMOTE_HOST
CONTENT_TYPE
SERVER_PROTOCOL	HTTP/1.1
QUERY_STRING
HTTP_ACCEPT_ENCODING	gzip, br, zstd, deflate
HTTP_X_FORWARDED_FOR	216.73.217.68
HTTP_DNT

Python Developer (Scrapy + Zyte)

Please login or register as jobseeker to apply for this job.

TYPE OF WORK

WAGE / SALARY

HOURS PER WEEK

DATE UPDATED

Why is this blurred?