PDF OCR & AI Text Extraction Specialist (Searchable PDFs + AI)

Please login or register as jobseeker to apply for this job.

TYPE OF WORK

Gig

SALARY

Negotiable

HOURS PER WEEK

40

DATE UPDATED

Apr 18, 2026

JOB OVERVIEW

We are looking for a highly detail-oriented and reliable virtual assistant to take over our current PDF-to-Word conversion workflow.

**Main Focus:**
The primary purpose of this role is to extract text from PDF documents and deliver it in clean Microsoft Word format so we can easily copy and paste the words into AI tools (such as Grok) whenever needed.

**Exact Daily Tasks:**
- Use any OCR software (Adobe Acrobat, ABBYY, PDFelement, or equivalent) to make PDF documents fully searchable and text-selectable.
- Split the searchable PDFs into small batches of pages.
- Upload each batch to Grok AI (or any AI program) and request text extraction while preserving the original formatting, line spacing, line breaks, and layout.
- Copy the extracted text exactly as Grok returns it (including page labels like **PAGE X**, separators, and structure).
- Paste the final text into a clean Microsoft Word (.docx) document.
- Ensure 100?curacy — no words can be missed or changed. These are important court/legal documents.

**Requirements:**
- Experience using OCR software to convert scanned/image PDFs into fully searchable PDFs.
- Comfortable working with Grok AI (or similar AI chat tools) and following very specific formatting instructions.
- Extremely careful and patient with repetitive, high-accuracy work.
- Able to deliver clean, perfectly formatted Word files every time that are ready for easy copy-paste into AI.

This is precise, detail-heavy work. Speed is good, but **accuracy is critical** — we cannot have any missing or altered text.

If you are detail-obsessed and can follow instructions exactly, this is a great long-term position.

VIEW OTHER JOB POSTS FROM:
SHARE THIS POST
facebook linkedin
  BENCHMARKS  
Loading Time: Base Classes  0.0011
Controller Execution Time ( Jobseekers / Job )  0.0154
Total Execution Time  0.0173
  GET DATA  
No GET data exists
  MEMORY USAGE  
1,470,216 bytes
  POST DATA  
No POST data exists
  URI STRING  
jobseekers/job/PDF-OCR-AI-Text-Extraction-Specialist-Searchable-PDFs-AI-1626839
  CLASS/METHOD  
jobseekers/job
  DATABASE:  onlinejobs (Jobseekers:$db)   QUERIES: 13 (0.0085 seconds)  (Hide)
0.0003   SELECT *
                                
FROM exrates
                                WHERE rate_name 
'USD-PHP' 
0.0004   SELECT *
FROM `employer_jobs`
WHERE `job_id` = 1626839
 LIMIT 1 
0.0009   SELECT *
FROM `employers`
WHERE `employer_id` = 925594
 LIMIT 1 
0.0012   SELECT COUNT(*) AS `numrows`
FROM `t_thread` `t`
LEFT JOIN `t_thread_misc` `miscON `t`.`id` = `misc`.`thread_id`
WHERE `t`.`job_id` = 1626839
AND `misc`.`idIS NULL 
0.0004   SELECT e.business_namee.logoe.websitee.rebill_datee.date_added member_datehitsDATEDIFF('2026-04-19',ej.date_added) duration_daysDATEDIFF('2026-04-19',e.rebill_date) duration_rebillej.*, e.deactivate FROM employers eemployer_jobs ej WHERE e.employer_id ej.employer_id AND
                                   ((
e.user_level >= '500' AND ej.date_added <= e.rebill_date)
                                   OR 
e.employer_id '' OR (ej.date_approved <> '2000-01-01' and DATEDIFF('2026-04-19',ej.date_added) <= 14 ))
                                   AND 
e.deactivate != AND ej.deleted AND job_id '1626839' 
0.0002   SELECT *
FROM `employer_jobs_skills` `ejs`
LEFT JOIN `skills_categories` `scON `ejs`.`skill_id` = `sc`.`id`
WHERE `job_id` = 1626839 
0.0015   UPDATE employer_jobs SET hit_counts '***Apr-18-2026=715***Apr-19-2026=1' WHERE job_id'1626839'  
0.0007   UPDATE employer_jobs SET monthly_hits '***Apr-2026=715' WHERE job_id'1626839'  
0.0008   SELECT date_sent FROM jobseeker_sent_emails WHERE jobseeker_id '' AND job_id '1626839' AND status LIKE 'sent%' ORDER BY id DESC  
0.0003   SELECT *
FROM `employer_jobs_skills` `ejs`
LEFT JOIN `skills_categories` `scON `ejs`.`skill_id` = `sc`.`id`
WHERE `job_id` = 1626839 
0.0004   SELECT COUNT(*) AS `numrows`
FROM `employer_jobs`
WHERE `employer_id` = '925594'
AND `date_added` >= '2022-06-08' 
0.0004   select from teasers 
0.0009   SELECT FROM skill_categories WHERE skill_cat_id='' 
  HTTP HEADERS  (Show)
  SESSION DATA  (Show)
  CONFIG VARIABLES  (Show)