Documentation

Smart Parsing

Automatically recognize and extract existing resume content, quickly convert to editable format, support multiple file formats

Updated: 1/15/2024

SmartCV's smart parsing feature utilizes advanced AI technology to automatically recognize and extract existing resume content, quickly converting resume files in various formats to editable structured data, greatly simplifying the resume creation process.

Overview

The smart parsing feature through deep learning algorithms can:

  • Format Recognition: Support multiple file formats such as PDF, Word, images
  • Content Extraction: Intelligently identify various sections and information of resumes
  • Structured Conversion: Convert unstructured content to standard format
  • Precise Matching: Accurately map to SmartCV's resume structure

Supported File Formats

Main Formats

📄 PDF Files

Support editable and scanned PDFs

  • Editable PDF (recommended)
  • Scanned PDF
  • Image-based PDF
  • Encrypted PDF
📝 Word Documents

Support various versions of Word format

  • .docx (recommended)
  • .doc
  • .rtf
  • .odt
🖼️ Image Formats

Recognize text in images through OCR

  • .jpg / .jpeg
  • .png
  • .bmp
  • .tiff

📊 Other Formats

Other common formats supported

  • .txt
  • .html
  • LinkedIn export
  • Online resume links

Usage Steps

  1. Enter Upload Page

    Click the "Upload Resume" or "Smart Parsing" button from the homepage or dashboard to enter the file upload interface.

  2. Select Upload Method

    The system provides multiple upload methods:

    • Drag and Drop Upload: Directly drag files to the upload area
    • Click Selection: Click button to select files from local
    • Link Import: Enter link address of online resume
    • Scan Upload: Use phone camera to upload paper resumes
  1. Start Parsing Processing

    After file upload is complete, click the "Start Parsing" button. The system will:

    1. Verify file format and size
    2. Perform OCR text recognition (if needed)
    3. Analyze document structure and layout
    4. Extract personal information and content
    5. Intelligent classification and matching
  1. View Parsing Results

    After parsing is complete, you will see:

    • Parsing Preview: Preview interface of extracted content
    • Accuracy Score: Evaluation score of parsing quality
    • Recognized Fields: Successfully identified information fields
    • Problem Prompts: Content requiring manual confirmation
  2. Confirm and Edit

    On the parsing results page:

    • Item-by-item Confirmation: Check accuracy of each field
    • Manual Adjustment: Correct incorrectly recognized content
    • Supplement Information: Add important information not recognized
    • Create Resume: Create new resume after confirmation
  3. Save to Editor

    After clicking "Create Resume", the parsed content will automatically fill in the resume editor, you can:

    • Select appropriate template
    • Further edit and optimize content
    • Adjust format and layout
    • Use AI optimization feature to enhance content

Parsing Capabilities Details

1. Personal Information Recognition

Smart parsing can accurately recognize and extract:

Basic Information

  • Name: Recognize Chinese and English names, including compound names
  • Contact Information: Mobile numbers, landline phones
  • Email Address: Recognition of various email formats
  • Address Information: Detailed address, city, postal code

Online Information

  • LinkedIn Profile: Recognize LinkedIn profile links
  • Personal Website: Portfolio, blog and other personal websites
  • Social Media: Relevant professional social media accounts
  • GitHub Links: Code repositories for technical personnel

2. Work Experience Extraction

Recognition Capabilities

  • Company Information: Company name, size, industry
  • Position Titles: Accurately recognize various position names
  • Time Period: Recognition of start and end times
  • Job Description: Responsibility descriptions and main achievements

Intelligent Processing

  • Time Parsing: Recognize various time format expressions
  • Responsibility Separation: Automatically separate different work responsibilities
  • Achievement Extraction: Recognize quantified work results
  • Skill Association: Extract relevant skills from descriptions

3. Education Background Analysis

Education Information

  • School Name: Recognition of various educational institutions
  • Major Information: Accurate matching of disciplines and majors
  • Degree Level: Bachelor, Master, PhD, etc.
  • Graduation Time: Enrollment and graduation times

Academic Achievements

  • GPA Scores: Extraction of academic scores
  • Honors and Awards: Scholarships, honorary titles
  • Course Projects: Important courses and project experiences
  • Academic Activities: Student union, club activities

4. Skills and Abilities Recognition

🔧 Technical Skills
  • Programming languages
  • Development frameworks
  • Database technologies
  • Tool software
🗣️ Language Skills
  • Language types
  • Proficiency levels
  • Certificate certifications
  • Application scenarios
🎯 Professional Skills
  • Industry professional skills
  • Certification qualifications
  • Professional tools
  • Practical experience
🤝 Soft Skills
  • Communication skills
  • Leadership
  • Team collaboration
  • Problem solving

Parsing Quality Optimization

Improve Parsing Accuracy

To get the best parsing results, it is recommended:

File Quality Optimization

📄 Best file conditions:

  • Use editable PDF or Word format
  • Ensure text is clear and readable, avoid blurriness
  • Use standard fonts, avoid artistic fonts
  • Maintain reasonable page layout, avoid overly complex designs

📷 Image upload suggestions:

  • Ensure sufficient lighting, clear text
  • Avoid tilting or distortion
  • Use high-resolution images
  • Ensure sufficient contrast between background and text

Content Structure Optimization

  • Clear Sections: Use clear section headings
  • Complete Information: Ensure important information is complete and accurate
  • Consistent Formatting: Keep date, contact information and other formats consistent
  • Avoid Tables: Complex tables may affect parsing results

Common Parsing Issues

Problem Types and Solutions

Problem TypePossible CausesSolutions
Name recognition errorSpecial characters or formatsManual confirmation and correction
Date format confusionMixed use of multiple date formatsRe-parse after unifying date format
Job description lossComplex layoutUse simplified document format
Contact information errorOCR recognition issuesVerify and manually correct

Advanced Features

Batch Parsing

For enterprise users or situations requiring processing multiple resumes:

  1. Batch Upload

    Upload multiple resume files simultaneously, the system will process them in order.

  2. Queue Processing

    Files will enter the parsing queue and be processed in sequence, you can view the processing progress.

  3. Result Management

    All parsing results will be saved in your account for convenient subsequent management and editing.

Incremental Updates

For updates to existing resumes:

🔄 Smart Merge
The system can recognize differences between old and new resumes, intelligently merge new work experience, skills and other information, avoiding duplicate entry.

API Integration

Developers can integrate parsing functionality through API interfaces:

// Resume parsing API example
const parseResume = async (file) => {
  const formData = new FormData()
  formData.append('resume', file)
 
  const response = await fetch('/api/resume-parser', {
    method: 'POST',
    body: formData
  })
 
  return response.json()
}

Best Practices

Pre-parsing Preparation

  1. File Check: Ensure files are complete, clear, and in correct format
  2. Content Organization: Confirm completeness and accuracy of resume content
  3. Format Simplification: If possible, use simpler layout formats
  4. Information Verification: Ensure all information is what you want to keep

Post-parsing Optimization

  1. Item-by-item Verification: Carefully check the accuracy of each parsed field
  2. Information Supplement: Add information that may have been missed during parsing
  3. Format Adjustment: Optimize the presentation format and structure of information
  4. Content Enhancement: Use AI optimization features to further improve content quality

Long-term Maintenance

  • Regular Updates: Update resume content in a timely manner to maintain timeliness
  • Version Management: Save multiple versions for comparison and selection
  • Backup Storage: Regularly backup important resume versions
  • Continuous Optimization: Continuously optimize resume content based on job search feedback

Common Questions

Q: What is the parsing accuracy rate?

A: For standard format resumes, the accuracy rate is usually between 85-95%. Complex design resumes may require more manual confirmation.

Q: Are non-English resumes supported?

A: Yes, the system supports resume parsing in multiple languages including Chinese and English, and can handle bilingual resumes.

Q: What should I do if parsing fails?

A: If parsing fails, you can try: 1) Use different file formats; 2) Simplify document layout; 3) Contact customer service for help.

Q: Is the parsed data secure?

A: All uploaded files are encrypted, and the original files will be automatically deleted after parsing is complete, ensuring your privacy and security.

Q: Can resumes in other languages be parsed?

A: Currently mainly supports Chinese and English resumes, support for other languages is under development.

Update History

  • v2.3.0: Added batch parsing feature, support for enterprise user needs
  • v2.2.0: Optimized OCR recognition algorithm, improved image parsing accuracy
  • v2.1.0: Added incremental update feature, support for smart resume content merging
  • v2.0.0: Fully upgraded parsing engine, support for more file formats
  • v1.9.0: Enhanced multi-language support, optimized Chinese resume parsing effects