Supported Formats & Languages
Detailed information about file formats, document types, languages, and expected accuracy
Supported File Formats
JPG / JPEG
Most common format
Ideal for scanned photographs of documents. Widely supported and efficient for storage.
PNG
Lossless quality
Best for preserving image quality without compression artifacts. Larger file sizes.
TIFF
Archival quality
Professional archival format. Excellent for high-quality scans from archives and libraries.
WebP
Modern format
Modern image format with excellent compression. Good balance of quality and file size.
File Requirements
- Maximum file size: 20 MB per image
- Recommended resolution: 300 DPI or higher for best results
- Color mode: RGB or Grayscale (color recommended for faded documents)
Language & Script Support
Our AI has been trained and tested on historical documents in various languages. Below is our honest assessment of accuracy levels based on actual processing results.
Tier 1: Extensively Tested
Highest ConfidenceThese languages have been thoroughly tested with thousands of documents. Expect reliable results on well-preserved scans.
Polish
Latin cursive script
Best for 19th-20th century civil records. Extensive testing on Masovia region documents.
5,000+ records processed
Latin
Church records
Strong support for Catholic parish registers. Familiar with common abbreviations.
3,000+ records processed
Russian
Cyrillic script
Good support for Russian Empire civil records (1850-1920). Handles Cyrillic cursive well.
2,500+ records processed
German
Gothic / Kurrent script
Good support for German Gothic script. Accuracy varies more with individual handwriting styles.
2,000+ records processed
Tier 2: Good Support
ReliableGood results on typical documents. Some challenging scripts or unusual handwriting may reduce accuracy.
Ukrainian
80-88%Cyrillic script, similar to Russian support
French
82-90%Latin cursive, civil records
Italian
80-88%Latin cursive
Spanish
80-88%Latin cursive
Portuguese
78-86%Latin cursive
Czech / Slovak
78-86%Latin cursive with diacritics
Tier 3: Experimental / Beta
Variable ResultsImportant: These languages have limited testing. Results may be inconsistent. We recommend starting with a small batch to evaluate quality before processing large collections.
Hebrew Script
- Hebrew
- Yiddish
60-80% accuracy, highly variable
Other Cyrillic
- Bulgarian
- Serbian
70-85% accuracy
Other Scripts
- Greek
- Armenian
Limited testing, results vary
Known Limitations
Accuracy can be significantly reduced by the following factors:
Document Condition
- Faded or water-damaged ink
- Torn or missing sections
- Heavy foxing or staining
- Bleed-through from reverse side
- Low resolution scans (<200 DPI)
Handwriting Challenges
- Highly inconsistent handwriting
- Unusual abbreviations or shorthand
- Decorative or elaborate scripts
- Very small or cramped writing
- Multiple overlapping hands
What To Do
If you receive low-confidence results, you can submit corrections and we'll refine our understanding. For particularly challenging documents, consider our priority tier which includes manual review options. See our Quality Guarantees page for refund policies.
Supported Document Types
Birth Records
Birth certificates, baptismal records, naming ceremonies
Marriage Records
Marriage certificates, banns, wedding records
Death Records
Death certificates, burial records, obituaries
Church Records
Parish registers, confirmation records, church books
Civil Registration
Government vital records, civil certificates
Other Historical Documents
Census records, wills, property documents
Tips for Best Results
Use high-resolution scans - 300 DPI or higher produces the best results. 600 DPI is ideal for faded documents.
Ensure good lighting - Avoid shadows and uneven illumination during scanning or photography.
Keep documents flat - Minimize curves, folds, and wrinkles when scanning bound volumes.
Include the full page - Don't crop too close to the text. Margins provide helpful context.
Color vs grayscale - Color scans often work better for faded or stained documents.
Test first - For new document types, try a few samples before processing large batches.
Ready to try our service?