33 projects
The Symfony PHP Framework
Symfony is a PHP web application framework designed for building robust, scalable, and maintainable web applications using reusable components and a structured MVC architecture. Itโs widely used for enterprise-level projects and forms the foundation of many other PHP platforms, including Laravel and Drupal.
16,949
3,344
$66M
Servo Project
The mission of the Project is to provide an independent, modular, embeddable web engine, which allows developers to deliver content and applications using web standards. NOTE: Servo Project was originally set up as a Series LLC (and under the Servo Project Fund). Both of those were archived on June 15, 2023 and Servo Project was transitioned to being a LF Europe Project with technical charter set at https://github.com/servo/project/blob/main/governance/CHARTER.md.
8,407
1,785
$1.4B
Nokogiri
Nokogiri is a Ruby library for parsing and manipulating HTML, XML, and SAX documents. It provides a robust API for reading, searching, modifying, and extracting data from structured documents using XPath and CSS selectors.
2,706
985
$4.4M
Cheerio
Cheerio is a fast, flexible, and lean implementation of core jQuery designed specifically for server-side HTML parsing and manipulation. It provides an API for traversing and modifying HTML/XML documents using familiar jQuery-like syntax.
2,076
704
$456K
jsoup
jsoup is a Java library for working with real-world HTML. It provides a very convenient API for fetching URLs and extracting and manipulating data, using the best of HTML5 DOM methods and CSS selectors. The library implements the WHATWG HTML5 specification and parses HTML to the same DOM as modern browsers do.
1,962
396
$1.3M
HTMLMinifier
Javascript-based HTML compressor/minifier (with Node.js support)
1,033
377
$882K
sanitize-html
Clean up user-submitted HTML, preserving whitelisted elements and whitelisted attributes on a per-element basis. Built on htmlparser2 for speed and tolerance
783
199
$102K
kramdown
kramdown is a fast, pure Ruby Markdown superset converter, using a strict syntax definition and supporting several common extensions.
752
329
$443K
fast-xml-parser
Validate XML, Parse XML and Build XML rapidly without C/C++ based libraries and no callback.
735
184
$153M
AngleSharp
:angel: The ultimate angle brackets parser library parsing HTML5, MathML, SVG and CSS to construct a DOM based on the official W3C specifications.
705
117
$6.3M
Html Agility Pack
Html Agility Pack (HAP) is a free and open-source HTML parser written in C# to read/write DOM and supports plain XPATH or XSLT. It is a .NET code library that allows you to parse "out of the web" HTML files.
554
73
$486K
htmlparser2
htmlparser2 is a fast and forgiving HTML/XML parser library for Node.js that can parse HTML according to the WHATWG HTML specification. It provides a streaming interface for efficiently parsing large chunks of data and supports custom handlers for processing parsed content.
477
175
$76K
Minify
Go minifiers for web formats
474
169
$23M
HTML React Parser
๐ HTML to React parser.
405
95
$98K
parse5
parse5 is a fast and specification-compliant HTML parsing/serialization toolset for Node.js. It provides a full-featured HTML parser that generates a DOM tree from HTML code, following the WHATWG HTML specification.
361
171
$874K
Floki
Floki is a fast and flexible HTML/XML parser written in Elixir that enables easy traversal and manipulation of HTML/XML documents using CSS selectors, similar to jQuery.
323
98
$2.6M
Loofah
Ruby library for HTML/XML transformation and sanitization
302
126
$231K
xmlbuilder-js
An XML builder for node.js
283
89
$232K
xmldom
A JavaScript implementation of the W3C DOM specification that allows parsing and serializing XML documents in Node.js and browser environments
245
84
$2.4M
rehype
HTML processor powered by plugins part of the @unifiedjs collective
179
69
$504K
REXML
REXML is a pure Ruby XML processor that provides a way to parse, validate, modify and generate XML documents in Ruby. It implements both DOM and SAX2 APIs for XML processing.
174
67
$1.4M
entities
A Node.js library for encoding and decoding HTML entities, providing functionality to convert special characters to their HTML entity representations and vice versa
111
52
$79K
hast-util-to-html
A JavaScript library that converts HAST (Hypertext Abstract Syntax Tree) nodes to HTML strings, providing a way to serialize HTML AST structures into their string representation
38
20
$133K
HTML DOM Parser
๐ HTML to DOM parser.
Web Metadata Scraper
Get unified metadata from websites using Open Graph, Microdata, RDFa, Twitter Cards, JSON-LD, HTML, and more.
html-entities
Fastest HTML entities encode/decode library
libxml2
Read-only mirror of https://gitlab.gnome.org/GNOME/libxml2
lxml
The lxml XML toolkit for Python
pugixml
Light-weight, simple and fast XML parser for C++ with XPath support