编程小知识，

没事学点编程小知识：
如何用5行代码从图片中提取文字

从图片中提取文字的过程通常称为OCR（Optical Character Recognition，光学字符识别）。在 Python 中，实现 OCR 最常用的工具和库之一是 pytesseract（Tesseract OCR 的 Python 包装器）以及 OpenCV 等图像处理库。

前提条件

安装 Tesseract OCR 引擎：
Windows用户可从官方下载安装⬇️
UB-Mannheim/tesseract
Image description

macOS用户可以使用 Homebrew 安装：
打开终端：输入 brew install tesseract

安装 pytesseract 和其他依赖：
pip install pytesseract
pip install Pillow ：用于图像基本处理
pip install opencv-python ：可选，用于图像预处理

一个最基本的从图片中提取文字的流程⬇️
Image description
注：如果识别中文出现错误,可能需要额外安装中文训练数据文件 chi_sim.traineddata。

macOS ⬇️
brew install tesseract-lang

其他方式从官方下载⬇️
tesseract-ocr/tessdata

Windows安装Tesseract OCR 引擎更改了默认安装路径后，需要手动配置path环境变量，并且代码添加pytesseract.pytesseract.tesseract_cmd = 安装路径