LFX Platform

Know more about LFX Platform

LFX Insights

Unicode Text Processing Libraries

Libraries that provide specialized functionality for handling Unicode text segmentation, grapheme cluster recognition, and other complex text processing operations across different languages and writing systems.

11 projects

1,659 contributors

$6.3M

Emoji

A Python library for working with emoji characters, providing functionality to convert between emoji names, Unicode characters, and aliases. It includes utilities for emoji detection, conversion, and manipulation in text strings.

Contributors

326

Organizations

94

Software value

$3.4M

Slugify

A JavaScript library that converts strings into URL-friendly slugs by removing special characters, converting spaces to hyphens, and handling transliteration of Unicode characters

Contributors

264

Organizations

92

Software value

$42K

XRegExp

XRegExp is a JavaScript library that provides extended regular expression functionality, including support for named capture groups, Unicode properties, and modular pattern composition. It extends JavaScript's native RegExp with additional syntax and features while maintaining compatibility.

Contributors

262

Organizations

104

Software value

$857K

Change Case

A JavaScript/TypeScript library that provides string case conversion utilities to transform text between different formats like camelCase, PascalCase, snake_case, and others

Contributors

253

Organizations

110

Software value

$42K

Python Slugify

A Python library that converts strings into URL-friendly slugs by removing special characters, converting spaces to hyphens, and handling Unicode characters. It provides customizable text normalization for clean URLs and filenames.

Contributors

153

Organizations

59

Software value

$40K

Textwrap

A Rust library that provides utilities for word wrapping and indenting text, with support for Unicode and configurable line breaks

Contributors

129

Organizations

63

Software value

$161K

he

A JavaScript library for HTML entity encoding and decoding, providing robust support for converting special characters to their corresponding HTML entities and vice versa

Contributors

103

Organizations

52

Software value

$725K

unicase

A Rust library for case-insensitive string comparisons and conversions, providing utilities to work with ASCII strings regardless of letter casing

Contributors

85

Organizations

48

Software value

$90K

uniseg

A Go library that provides Unicode text segmentation algorithms for breaking text into grapheme clusters, words, and sentences according to the Unicode Standard Annex #29 rules

Contributors

59

Organizations

22

Software value

$892K

string-length

A JavaScript utility package that gets the real length of a string by correctly counting astral symbols and ignoring ansi escape codes

Contributors

25

Organizations

16

Software value

$3.7K

is-unicode-supported

A Node.js package that detects whether Unicode is supported in the current terminal environment

This project hasn't been onboarded to LFX Insights.
Looking for a project that’s not listed?