aboutsummaryrefslogtreecommitdiff
path: root/README
diff options
context:
space:
mode:
Diffstat (limited to 'README')
-rw-r--r--README66
1 files changed, 51 insertions, 15 deletions
diff --git a/README b/README
index 7ab303b..114dfb3 100644
--- a/README
+++ b/README
@@ -1,34 +1,70 @@
1libextractor 1About
2============ 2=====
3
4GNU libextractor is a simple C library for keyword extraction.
5Common use-cases for GNU libextractor include detail-views in
6file managers, detailed search results in file-sharing networks
7and general information gathering in forensics investigations
8and penetration testing.
3 9
4libextractor is a simple library for keyword extraction. libextractor 10Bindings for GNU libextractor exists for many languages in addition to
5does not support all formats but supports a simple plugging mechanism 11the standard C/C++ API (we know about bindings for Java, Perl, PHP,
6such that you can quickly add extractors for additional formats, even 12Mono, Python, Ruby).
7without recompiling libextractor. libextractor typically ships with a
8few dozen helper-libraries (plugins) that can be used to obtain
9keywords from common file-types.
10 13
11libextractor is a GNU package (http://www.gnu.org/). 14libextractor uses a plugin mechanism to enable developers to quickly
15add extractors for additional formats. Plugins are executed
16out-of-process and can thus bugs in them (or the libraries that they
17use) cannot crash the main application. libextractor typically ships
18with a few dozen plugins that can be used to obtain keywords from
19common file types.
12 20
13More detailed documentation is available in the GNU libextractor manual. 21More detailed documentation is available in the GNU libextractor
22manual. libextractor is an official GNU package and available from
23http://www.gnu.org/s/libextractor/.
14 24
15 25
16extract 26extract
17======= 27=======
18 28
19extract is a simple command-line interface to libextractor. 29extract is a simple command-line interface to GNU libextractor.
20
21 30
22 31
23Dependencies 32Dependencies
24============ 33============
25 34
26* zlib (compression library)
27* GNU C/C++ compiler 35* GNU C/C++ compiler
28* libltdl 2.2.x (from GNU libtool) 36* libltdl 2.2.x (from GNU libtool)
29* GNU libtool 2.2 or higher 37* GNU libtool 2.2 or higher
30* GNU gettext 38* GNU gettext
31 39
40The following dependencies are all optional, but should be
41available in order for maximum coverage:
42
43* libavutil / libavformat / libavcodec / libswscale (ffmpeg)
44* libbz2 (bzip2)
45* libexiv2
46* libflac
47* libgif (giflib)
48* libglib (glib)
49* libgtk+
50* libgsf
51* libgstreamer
52* libjpeg
53* libmagic (file)
54* libmpeg2
55* librpm
56* libtidy
57* libtiff
58* libvorbis / libogg
59* libz (zlib)
60
32When building libextractor binaries, please make sure all of these 61When building libextractor binaries, please make sure all of these
33dependencies are available. Otherwise the build system may 62dependencies are available and configure detects a sufficiently recent
34automatically build only a subset of libextractor. 63installation. Otherwise the build system may automatically build only
64a subset of GNU libextractor resulting in mediocre meta data
65production.
66
67Finally, 'zzuf' is a fuzzing tool that can optionally be detected by
68the build system and be used for debugging / testing. It is not required
69at runtime or for normal builds.
70