commit | d127078cedef9c6642cbe592dacdd2292b50bb19 | [log] [tgz] |
---|---|---|
author | Jinsuk Kim <jindor.code@gmail.com> | Mon Feb 12 21:23:14 2024 |
committer | GitHub <noreply@github.com> | Mon Feb 12 21:23:14 2024 |
tree | 235ccfd44deaee7f998a98133cfa9da58615144d | |
parent | 9f8e8f50dfa6be0243511b42ff443498458c029a [diff] | |
parent | dcab2ba7cbe0ebab593a1f2f378155ae30024561 [diff] |
Merge pull request #28 from cloudaper/add-3rd-party-to-readme Add 3rd party bindings to README
Compact Encoding Detection(CED for short) is a library written in C++ that scans given raw bytes and detect the most likely text encoding.
Basic usage:
#include "compact_enc_det/compact_enc_det.h" const char* text = "Input text"; bool is_reliable; int bytes_consumed; Encoding encoding = CompactEncDet::DetectEncoding( text, strlen(text), nullptr, nullptr, nullptr, UNKNOWN_ENCODING, UNKNOWN_LANGUAGE, CompactEncDet::WEB_CORPUS, false, &bytes_consumed, &is_reliable);
You need CMake to build the package. After unzipping the source code , run autogen.sh
to build everything automatically. The script also downloads Google Test framework needed to build the unittest.
$ cd compact_enc_det $ ./autogen.sh ... $ bin/ced_unittest
On Windows, run cmake .
to download the test framework, and generate project files for Visual Studio.
D:\packages\compact_enc_det> cmake .
Have you created bindings for another language? Open a PR and add it to the list!