Skip to content

FSU Research Computing Center Documentation

tesseract

FSU Research Computing Center Documentation

Basics and General
Basics and General
High Performance Computing
High Performance Computing
- HPC Overview
- Using the HPC
  Using the HPC
- Open OnDemand
  Open OnDemand
- Other HPC Information
  Other HPC Information
Storage and Data
Storage and Data
Data Center Services
Data Center Services
Software Catalog
Software Catalog
- Software List
- Contributing
- Python
  Python
- Libraries
  Libraries
  - ARPACK
  - ANTS
  - Armadillo C++
  - BLAS
  - CmdSTAN
  - CUDA
  - HDF4
  - HDF5
  - h5py
  - LAPACK
  - MVAPICH
  - OpenBLAS
  - OpenCV
  - NetCDF
  - nbo7
  - OpenMPI
  - ScaLAPACK
- Compilers
  Compilers
- Applications and Tools
  Applications and Tools
  - ABINIT
  - ABySS
  - Agisoft Metashape
  - Apache Spark
  - Apptainer
  - Atsas
  - bcl2fastq
  - BLAT
  - bedtools
  - BEST
  - Bowtie 2
  - BWA
  - CDO
  - CellRanger
  - CmdSTAN
  - dcraw
  - Desmond
  - DosBox
  - DSuite
  - FASTA
  - FastQC
  - FastX-Toolkit
  - FFmpeg
  - FFTW
  - FMRIPrep
  - FPLO
  - FreeSurfer
  - FSL
  - Gaussian
  - gemBS-rs
  - GULP
  - Grace
  - GROMACS
  - GSL
  - JAGS
  - Julia
  - LAAMPS
  - MAFFT
  - MATLAB
  - Meshroom
  - migrate-n
  - MUMmer
  - NAMD
  - Ncview
  - NWChem
  - NiftyReg
  - OpenRefine
  - ORCA
  - P4VASP
  - ParaView
  - Picard
  - pkg-config
  - R (statistical software)
  - RELION
  - Samtools
  - SPSS
  - STAR
  - Stata
  - tesseract tesseract
    Table of contents
    
    Running tesseract on RCC Resources
  - TopHat
  - Trimmomatic
  - VASP
  - Wannier90
  - WGrib
  - VisIt
  - VMD
HPC Drivers Ed
HPC Drivers Ed
- HPC Driver's Ed
- Course Modules
- Module 1 - Intro to HPC
  Module 1 - Intro to HPC
- Module 2 (Track One) - SSH/Terminal
  Module 2 (Track One) - SSH/Terminal
- Module 3 (Track Two) - Open OnDemand
  Module 3 (Track Two) - Open OnDemand
- Module 4 - Python on the HPC
  Module 4 - Python on the HPC
- Module 5 - MATLAB on the HPC
  Module 5 - MATLAB on the HPC
  - MATLAB on HPC
  - Toolbox Installation
- Module 6 - R on the HPC
  Module 6 - R on the HPC
  - R on HPC
  - Package Installation
- Module 7 - Troubleshooting
  Module 7 - Troubleshooting
- Final Certification

tesseract

An open source optical character recognition (OCR) platform.

Homepage Version(s): 4.1.0

Tesseract is an open source optical character recognition (OCR) platform. OCR extracts text from images and documents without a text layer and outputs the document into a new searchable text file, PDF, or most other popular formats. Tesseract is highly customizable and can operate using most languages, including multilingual documents and vertical text.

Running tesseract on RCC Resources#

To run tesseract on the HPC, you can directly run the command from the terminal as it does not require loading an environment module. In the example below, simply replace imagename and outputbase with your filenames.

1	`$ tesseract imagename outputbase [options...] [configfile...]`

The options and config file content are all listed out on the GitHub page.