Home PHP Software PHP Libraries PDF Parser, Unlock Data from PDF Documents

PDF Parser, Unlock Data from PDF Documents

October 3, 2020

PDF Parsers are very useful tools for data researchers, scientists or even journalists. Lots of data is available today online but locked in PDF files. PDF Parser is a simple PHP library to parse PDF files and extract elements like text. It will be great if it could be parse tables, but since the library is under active development, we hope this will be added to the todo list with secure PDF Documents. Extracting data from PDF tables a very hard and tough task, and actually there is only one open source software that can do it correctly but not automated.

Some features of the PDF Parser :

Load and parse objects and headers
Extract metadata (author, description, keywords, …)
Extract text from ordered pages
Support for compressed pdf (and not)
Support of charset encoding (WinAnsi, MacRoman)
Handling of hexa and octal content encoding
PSR-0 compliant (autoloader)
Compatible with Composer
PSR-1 compliant

Documentation available here, Released under LGPL-v3 license. For more information https://www.pdfparser.org/

Which Boring but Cheap Web Development Stack to Use in 2023?

Introducing Pest 2.0: The Next Generation of PHP Testing

PHP Design Patterns Game : The Singleton Pattern

Introducing the PHP Design Patterns Game Series

A Beginner’s Guide to Business Management for Freelancers

5 Common Mistakes Freelancers Make and How to Avoid Them

Navigating Freelance Contracts: What to Look for and What to Avoid

The Art of Saying ‘No’ to Clients: Setting Boundaries for Freelancers

Getting Started with Bref: Deploying Serverless PHP Applications on AWS Lambda

Unlock the Power of Real-time Intelligence with Redis Enterprise Cloud on…

Centreon, powerful IT and Application monitoring software

MySQL 8.0.26 and 5.7.35 Released

PeachPie, Do We Really Need a .NET Development Platform for PHP…

React Introduce Zero-Bundle-Size React Server Components

Bootstrap 4.6.0 Released with all new backend !

Visx, low-level visualization primitives for React by Airbnb

The Power of Design Sprints: A Modern Approach to Software Development

Optimizing your SQL queries: Understanding the execution order

Introducing MRSK: Your Ticket to Deploying Web Apps Anywhere

The Balancing Act of Web Security and Performance: How to Keep…

PDF Parser, Unlock Data from PDF Documents

Like this:

Related

LEAVE A REPLY Cancel reply

Social Media

Latest articles

Get Started with Laravel Volt: A Free Full Stack Laravel App...

Unleashing the Power of PHP Fibers: Boost Web Development with Efficient...

Which Boring but Cheap Web Development Stack to Use in 2023?

Initializer for Laravel – A Visual Approach to Setting Up a...

PHP Coding Puzzle 10 : Sudoku Game

Laravel-Websockets, WebSocket Server Implemented in PHP