1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
244
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
290
291
292
293
294
295
296
297
298
299
300
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
318
319
320
321
322
323
324
325
326
327
328
329
330
331
332
333
334
335
336
337
338
339
340
341
342
343
344
345
346
347
348
349
350
351
352
353
354
355
356
357
358
359
360
361
362
363
364
365
366
367
368
369
370
371
372
373
374
375
376
377
378
379
380
381
382
383
384
385
386
387
388
389
390
391
392
393
394
395
396
397
398
399
400
401
402
403
404
405
406
407
408
409
410
411
412
413
414
415
416
417
418
419
420
421
422
423
424
425
426
427
428
429
430
431
432
433
434
435
436
437
438
439
440
441
442
443
444
445
446
447
448
449
450
451
452
453
454
455
456
457
458
459
460
461
462
463
464
465
466
467
468
469
470
471
472
473
474
475
476
477
478
479
480
481
482
483
484
485
486
487
488
489
490
491
492
493
494
495
496
497
498
499
500
501
502
503
504
505
506
507
508
509
510
511
512
513
514
515
516
517
518
519
520
521
522
523
524
525
526
527
528
529
530
531
532
533
534
535
536
537
538
539
540
541
542
543
544
545
546
547
548
549
550
551
552
553
554
555
556
557
558
559
560
561
562
563
564
565
566
567
568
569
570
571
572
573
574
575
576
577
578
579
580
581
582
583
584
585
586
587
588
589
590
591
592
593
594
595
596
597
598
599
600
601
602
603
604
605
606
607
608
609
610
611
612
613
614
615
616
617
618
619
620
621
622
623
624
625
626
627
628
629
630
631
632
633
634
635
636
637
638
639
640
641
642
643
644
645
646
647
648
649
650
651
652
653
654
655
656
657
658
659
660
661
662
663
664
665
666
667
668
669
670
671
672
673
674
675
676
677
678
679
680
681
682
683
684
685
686
687
688
689
690
691
692
693
694
695
696
697
698
699
700
701
702
703
704
705
706
707
708
709
710
711
712
713
714
715
716
717
718
719
720
721
722
723
724
725
726
727
728
729
730
731
732
733
734
735
736
737
738
739
740
741
742
743
744
745
746
747
748
749
750
751
752
753
754
755
756
757
758
759
760
761
762
763
764
765
766
767
768
769
770
771
772
773
774
775
776
777
|
Mon Nov 28 17:16:16 CET 2011
Reduce false-positives in MP3 extractor file format detection. -LRN
Mon Nov 28 17:15:59 CET 2011
Improved winsock2 detection. -LRN
Mon Nov 28 12:17:42 CET 2011
Fixing compiler warnings, cleaning up ASF plugin. Finishing
Matroska plugin. -CG
Releasing libextractor 0.6.3.
Fri Jul 22 21:46:32 CEST 2011
Added Ukrainian translation. -CG
Sat Aug 14 23:01:59 CEST 2010
Various minor bugfixes (largely resource leaks on error paths). -CG
Sun Jun 13 13:15:43 CEST 2010
Releasing libextractor 0.6.2.
Sat Jun 12 22:32:32 CEST 2010
Fixing various bugs, including some that can cause crashes given
malformed inputs. -CG
Sat Jun 12 16:23:14 CEST 2010
Only pass 'unsigned char's to 'isspace' and similar functions. -CG
Sun Mar 14 00:09:45 CET 2010
Releasing libextractor 0.6.1.
Wed Jan 13 17:11:07 CET 2010
Releasing libextractor 0.6.0.
Wed Jan 13 14:36:24 CET 2010
Adding support for extracting data from the end of files. -CG
Sun Dec 13 16:53:35 CET 2009
Starting with major API breakage with the goal to fix all of
the not-so-nice things that have accumulated since version 0.0.0.
Added support for out-of-process execution from plugins. -CG
Sat Dec 5 11:32:30 CET 2009
Adding extraction of Iptc data using exiv2.
Sat Jul 4 23:05:22 CEST 2009
Fixed code to work with RPM 4.7.
Releasing libextractor 0.5.23.
Sat Apr 11 20:46:14 MDT 2009
Removed code from libexiv2, linking against it instead.
Fri Feb 20 00:28:00 MST 2009
Releasing libextractor 0.5.22.
Sun Feb 15 16:57:46 MST 2009
Upgraded to libtool 2.x (and the libltdl from that release).
Sun Nov 2 20:19:02 MST 2008
Releasing libextractor 0.5.21.
Tue Aug 12 04:40:49 EEST 2008
Added an S3M (Scream Tracker 3 Module) plugin.
Tue Aug 12 03:55:01 EEST 2008
Added an XM (eXtended Module) plugin.
Mon Aug 11 00:43:46 EEST 2008
Added an IT (Impulse Tracker) plugin.
Tue Jul 22 02:51:33 MDT 2008
Changed RPM extractor to use librpm.
Fixed crash in OpenOffice extractor.
Fixed crash in tiff extractor.
Sun Jul 13 19:31:35 MDT 2008
Fixed endianess issues in mp3 extractor.
Fixed build issues (need to link C++ code explicitly against
libstdc++ on BSD).
Releasing libextractor 0.5.20c.
Mon Jun 23 19:05:07 EET 2008
Fixed concurrency issues in plugin (un-)loading by adding locking
around libltdl functions.
Fri Jun 20 23:34:02 EET 2008
Added an FFmpeg-based thumbnail extractor plugin, initially
supporting only bmp and png files.
Mon Apr 28 08:40:43 MDT 2008
Updated Dutch translation.
Fri Apr 25 08:29:29 MDT 2008
Fixed security issues in XPDF-based PDF extractor.
Releasing libextractor 0.5.20b.
Mon Mar 21 00:00:52 MDT 2008
Releasing libextractor 0.5.20a.
Tue Apr 1 10:06:03 MDT 2008
Updated Swedish translation.
Sun Mar 30 08:31:11 MDT 2008
Updated Vietnamese translation.
Sun Mar 23 14:40:58 MDT 2008
Updated German translation.
Sat Mar 22 19:29:49 MDT 2008
Updated Gaelic translation.
Fri Mar 21 13:26:33 MDT 2008
Added Dutch translation.
Thu Mar 20 23:38:47 MDT 2008
Releasing libextractor 0.5.20.
Fri Mar 7 13:29:01 EET 2008
Added disc number.
Thu Mar 6 23:11:39 MST 2008
Added track number and ISRC for FLAC/mp3/ogg files.
Wed Feb 13 10:16:55 EET 2008
Added a plugin for AppleSingle/AppleDouble files.
Mon Feb 11 22:58:48 MST 2008
Various minor code cleanups.
Sat Jan 12 14:14:32 MST 2008
Fixed security issues in XPDF-based PDF extractor.
Releasing libextractor 0.5.19a.
Mon Jan 7 08:46:40 MST 2008
Releasing libextractor 0.5.19.
Wed Dec 26 19:38:22 MST 2007
Added a FLAC (.flac) plugin.
Wed Dec 26 14:50:11 EET 2007
Added a Flash Video (.flv) plugin.
Mon Dec 24 18:26:56 MST 2007
Add support for some common iTunes tags to qtextractor.
Mon Dec 10 17:27:28 MST 2007
Disable libgsf logging (for corrupt files).
Sun Jul 29 02:30:40 MDT 2007
Added escape (\n) handling to split extractor.
Wed Jul 4 17:36:53 MDT 2007
Fixed problem with newer versions of libgsf.
Fixed problem with automake 1.10 not setting MKDIR_P.
Releasing libextractor 0.5.18a.
Sat Jun 9 01:34:21 MDT 2007
Working on Qt build process. Created TexInfo manual.
Sun Mar 11 17:58:14 MDT 2007
Releasing libextractor 0.5.18.
Fri Feb 23 18:43:33 MST 2007
Fixing symbols for thumbnail extractors.
Thu Feb 8 13:01:34 MST 2007
Upgrade to gettext-0.16.1.
Sun Feb 4 23:51:08 MST 2007
Better handling of build process without C++ compiler.
Sun Jan 28 20:54:35 MST 2007
Biased removal of duplicate keywords against those obtained
from splitting. Do not allow splitextractor to produce mere
copy of original keyword. Fixed two minor bugs.
Sat Jan 6 14:27:18 EET 2007
Added an NSFE (Extended Nintendo Sound Format) plugin.
Tue Jan 2 19:38:10 MST 2007
Fixed various build issues. Releasing libextractor 0.5.17a.
Mon Jan 1 19:10:40 MST 2007
Added pkgconfig support. Releasing libextractor 0.5.17.
Thu Dec 28 20:22:20 MST 2006
Fixed bug in splitextractor, addressing also Mantis #1125.
Thu Dec 28 18:12:15 MST 2006
Added -g (greppable output, Mantis #1157) option to extact.
Mon Nov 20 22:08:55 EET 2006
Added an SID (C64 music file) plugin.
Sat Nov 11 16:04:38 MST 2006
Fixed libltdl side-effect of loading libextractor; code
now preserves the old library search path and only appends
its own. Linking main libextractor library against
libgsf (as workaround for GSF bug). Releasing libextractor 0.5.16.
Sat Nov 11 00:04:34 EET 2006
Added an NSF (NES Sound Format) plugin.
Sat Sep 16 12:36:42 MDT 2006
Added support for various additional tags to ID3v2 extractors.
Now (again) trimming whitespace at the end of ID3v1 tags.
Wed Sep 6 13:38:55 PDT 2006
Added tIME support to PNG extractor. Bugfixes in PDF extractors.
Made libextractor relocateable (plugin path no longer hardwired
into binary, using various tricks instead to find path).
Translation updates. Releasing libextractor 0.5.15.
Wed May 17 02:05:37 PDT 2006
Switched mpegextractor to use libmpeg2 (improves correctness,
adds dependency!). Releasing libextractor 0.5.14.
Tue May 16 20:08:30 PDT 2006
Dramatically improved qt extractor (essentially re-written from
scratch).
Fri Apr 28 22:26:43 PDT 2006
Integrated wordleaker into OLE2 plugin. Changed OLE2 plugin to use
libgsf (new dependency!). Releasing libextractor 0.5.13.
Fri Apr 28 16:18:26 PDT 2006
Fixing some i18n issues. Specifically, EXTRACTOR_getKeywordTypeAsString
will now never return the translated version of the keyword type
(before, it *sometimes* returned the translated version, depending on
the default gettext domain and translation availability). If translation
is desired, clients should use 'dgettext("libextractor", ret-value)' to
translate the returned value.
Wed Apr 26 12:20:00 PDT 2006
Some improvements for OpenBSD portability.
Wed Apr 26 10:28:11 PDT 2006
Added Vietnamese and Swedish translations.
Sat Apr 22 11:18:56 PDT 2006
Final touches to new build of printable extractors.
Releasing libextractor 0.5.12.
Tue Apr 18 14:44:37 PDT 2006
Improved memory utilization for printable extractors
at compile time. Added dictionaries for Finnish,
French, Gaelic and Swedish (for printable extractors).
Fri Mar 24 21:43:43 PST 2006
Started re-implementation of PDF support from scratch
(incomplete but working). Improvements to the build
system.
Thu Mar 9 17:46:39 PST 2006
Added support for wordleaker (additional meta-data for
OLE2 streams). Releasing libextractor 0.5.11.
Sat Feb 18 17:39:10 PST 2006
Yet another round of XPDF-related security fixes.
Releasing libextractor 0.5.10.
Tue Jan 31 12:51:55 PST 2006
Mis-detection of man pages as part of TAR archives fixed.
Wed Jan 11 11:33:46 PST 2006
More Mime-types for the OLE2 extractor. Also ignore (harmless)
libc errors in plugins when extracting.
Thu Jan 5 16:51:36 PST 2006
More TAR improvements: keywords 'date' and 'format' are
extracted. More checksums variants were added. Long filenames
as produced by GNU and Schilling tar (possibly Solaris pax also)
are extracted.
Fri Dec 23 11:28:23 PST 2005
Releasing libextractor 0.5.9.
Sun Dec 11 23:52:50 PST 2005
Made TAR extractor parsing more robust.
Fri Dec 9 23:17:21 PST 2005
Fixing crash in MIME-extractor due to typo in the code.
Tue Dec 6 13:25:56 PST 2005
Fixed security problems in PDF extractor
(http://www.idefense.com/application/poi/display?id=344&type=vulnerabilities)
Releasing libextractor 0.5.8.
Sun Dec 4 23:36:00 PST 2005
Fixed AVI mime-type to be video/x-msvideo.
Sat Nov 12 10:50:46 PST 2005
Releasing libextractor 0.5.7.
Wed Nov 9 12:51:52 PST 2005
Fix in LE unload code (potential double-free, maybe BSD-specific).
Tue Sep 27 11:01:57 PDT 2005
Again better Mime-type detection for OLE2 streams.
Mon Sep 26 20:44:10 PDT 2005
Minor improvements to the PDF extractor:
- first change is to avoid outputting keywords with empty values
(for now the new check is only effective when the value is seen
as a string of 8-bit characters; I'm not sure how to rewrite it
for the Unicode case in the if branch just above.)
- second change is to remap PDF Creator as 'software' keyword
instead of 'creator'.
Sun Sep 25 11:31:51 PDT 2005
Made sure extract returns error code (1) if some files could not
be accessed.
Thu Sep 22 21:05:53 PDT 2005
Improved TAR extractor:
- it now accepts old-style (UNIX V7) archives
- it produces a mimetype for old-style archives
- it outputs the file names in the same order as in the TAR file
- its end-of-file mark detection is more robust
Updated German translation.
Wed Sep 21 13:54:19 PDT 2005
Added Irish translation.
Wed Sep 21 00:01:01 PDT 2005
Fixed gettext build problem.
Removed warning that the OLE2 extractor was printing.
Sun Sep 18 19:34:48 PDT 2005
Major rewrite of the HTML extractor. Should extract more, is
simpler and probably more robust.
Releasing libextractor 0.5.6.
Fri Sep 16 16:41:04 PDT 2005
Made LE malloc file READ-ONLY. This should help the VM conserve
memory, however this breaks the HTML extractor.
Thu Sep 15 21:55:19 PDT 2005
Fixing compiler warnings given by gcc 4.0.
Thu Sep 15 00:56:51 PDT 2005
Fixed incorrectly handled integer overflow in png extractor.
Wed Sep 14 15:02:49 PDT 2005
Avoid malloc/memcpy of file in exiv2 extractor (optimization, avoids
problems with very large files where malloc may run into problems).
Wed Sep 14 13:50:15 PDT 2005
Changed code for backwards-compatibility with zlib 1.1
(thanks to Ronan Melennec).
Tue Sep 13 04:49:43 PDT 2005
Fixed segmentation fault in bz2 processing.
Fri Sep 9 14:57:10 PDT 2005
Fixed bug in decompression code that occured if the compressed file
expanded to more than twice its original size.
Wed Sep 7 21:41:35 PDT 2005
Added decompression of gz and bz2 streams to the LE core library
(avoids need to do this, possibly repeatedly, in plugins and makes
sure that all plugins work with compressed files). Eliminated gz
decompression from man and tar extractors.
Releasing libextractor 0.5.5.
Sun Sep 4 02:08:56 PDT 2005
Changed code to export fewer symbols (refactoring plus linker options,
goal is to address Mantis #925. Changed debian extractor to no longer
require threads. Dead code elimination in OO and OLE2 extractors.
Minor bugfixes ported from libgsf 1.12.2 to OLE2 extractor.
Fri Sep 2 03:17:10 PDT 2005
Added support for Mime-types for Microsoft Office formats.
Fri Aug 26 22:32:06 PDT 2005
Added workaround libstdc++ bug #23591 (Mantis bug #907).
Releasing libextractor 0.5.4.
Tue Aug 23 15:39:37 PDT 2005
Fixed build on OS X.
Tue Aug 23 12:35:35 PDT 2005
Fixed character set conversion in OLE2 extractor
(big thanks to Jody Goldberg).
Sat Aug 20 21:27:17 PDT 2005
Fixed memory leak in thumbnail extractor.
Thu Aug 18 21:18:28 PDT 2005
Made quotations match GNU standards.
Sat Aug 13 18:41:02 PDT 2005
Fixed problems with ole2 extractor. Also removed
requirement for static version of glib (!).
Releasing libextractor 0.5.3.
Fri Aug 12 23:53:54 PDT 2005
Fixed bug in OO extractor that made it not work.
Fixed bug in exiv2 extractor that killed keywords
found by other extractors.
Improved OO extractor mime-type detection.
Mon Aug 8 12:18:44 PDT 2005
Somehow addKeyword2 got lost. Added (again?).
Fixed compilation problems with gcc-2.95.
Thu Jul 14 18:52:17 CEST 2005
Bugfixes in exiv2 extractor fixing remaining issues.
Changed plugins to not use filename but always only
rely on mmapped memory. Extended API with function
that allows running getKeywords on data in memory
(instead of filename). Extended API with encode
and decode functions for binary metadata.
Releasing libextractor 0.5.2.
Mon Jul 4 18:10:14 CEST 2005
Preliminary integration of exiv2 support (not enabled
by default due to bugs). Moved Python and Java
bindings into separate packages.
Releasing libextractor 0.5.1.
Wed Jun 29 15:37:51 CEST 2005
Finally found out how to disable building static libs.
This should cut down compile time and installed size
by about a factor of 2 -- especially good since the
static version of the plugins is pretty, well, useless.
Sat Jun 18 14:56:38 EST 2005
Fixed a score of compiler warnings and some minor bugs,
none of which should have been observable.
Sat May 21 13:42:46 EST 2005
Releasing libextractor 0.5.0.
Fri May 6 14:54:58 EST 2005
Added flag to disable building of printable extractors
(important for systems with not that much memory).
Fri May 6 06:02:02 EST 2005
Added Python binding.
Tue Apr 5 17:22:28 EST 2005
Added translation to Kinyarwanda.
Thu Feb 24 00:32:44 EST 2005
Added extractor that extracts binary (!) thumbnails from
images using ImageMagick. Decoder function for the binary
string is in the thumbnailextractor.c source.
Releasing libextractor 0.4.2.
Wed Feb 23 22:42:08 EST 2005
Comment tag was not extracted from ID3 tags. Fixed.
Sun Feb 20 16:36:17 EST 2005
Fixed similar problem in REAL extractor. Added support
for new Helix/Real format to REAL extractor.
Sun Feb 20 12:48:15 EST 2005
Fixed (rare) integer overflow bug in PNG extractor.
Sat Feb 19 22:58:30 EST 2005
Fixed problems with wrong byteorder for Unicode decoding
in PDF meta-data. Fixed minor problems with character
set conversion error handling.
Wed Jan 26 19:31:04 EST 2005
Workaround possible bug in glib quarks (OLE2 extractor).
Improved QT support (?nam tag, support for description).
Releasing libextractor 0.4.1.
Fri Jan 21 15:23:43 PST 2005
Adding support for creation date for tar files.
Fixed security problem in PDF extractor.
Sun Jan 2 21:12:52 EST 2005
Fixing some linking problems.
Fri Dec 31 20:26:43 EST 2004
Excluding executables from printable extractors.
Sat Dec 25 19:24:54 CET 2004
PDF fixes. Fixing mantis bug (PDF charset conversion
for UTF-8 console). Releasing libextractor 0.4.0.
Fri Dec 24 15:43:35 CET 2004
Adding support calling LE for python (draft, not
tested, possibly not working yet).
Fri Dec 24 13:28:59 CET 2004
Added support for Unicode to the pdf extractor.
Fri Dec 24 09:14:08 CET 2004
Improving mp3 (Id3v1): adding genres, minor
bugfixes.
Fri Dec 24 07:23:03 CET 2004
Improving PNG: converting to utf-8 and handling
compressed comments.
Thu Dec 23 18:14:10 CET 2004
Avoided exporting symbol OPEN (conflicts on OSX
with same symbol from GNUnet). Added conversion
to utf8 to various plugqins (see todo) and
added conversion from utf8 to current locale to
print keywords.
Sat Nov 13 13:23:23 EST 2004
Releasing libextractor 0.3.11.
Fri Nov 12 19:20:37 EST 2004
Fixed bug in PDF extractor (extremely rare segfault).
Fixed #787.
Fixed bug in man extractor (undocumented return value running on
4 GB file not taken care of properly).
Sat Oct 30 20:18:21 EST 2004
Fixing various problems on Sparc64 (bus errors).
Workaround for re-load glib problem of OLE2 extractor.
Sat Oct 23 13:21:23 EST 2004
Releasing libextractor 0.3.10.
Fri Oct 22 22:22:28 EST 2004
Fixing memory leak after extensive valgrinding.
Fri Oct 22 19:18:38 EST 2004
id3v2.3 and id3v2.4 work. Some bugfixes.
Sun Oct 17 18:12:11 EST 2004
tar and tar.gz work. Releasing libextractor 0.3.9.
Sun Oct 17 17:42:16 EST 2004
deb works.
Sun Oct 17 13:52:25 EST 2004
man works.
Tue Oct 5 14:29:31 EST 2004
Updated xpdf extractor (to fix Mantis #754). Fixed bug in Id3v2
extractor (potential segfault). Added support for extracting
image size from jpeg. General code cleanup. 64-bit file
support.
Mon Oct 4 20:28:52 EST 2004
Fixed jpeg extractor to not hang on certain malformed JPEG files.
Sat Oct 2 18:02:56 EST 2004
Added support for dvi. Removed special code for OS X,
normal libtool works fine now (and suddenly LE works for OS X).
Releasing libextractor 0.3.8.
Sun Sep 26 19:25:10 EST 2004
Moved libextactor plugins to separate directory, building
plugins as plugins and not as libraries.
Thu Sep 23 11:25:42 EST 2004
Added support for ID3v2. Added support for StarOffice (OLE2).
Fixed some minor build issues. Releasing libextractor 0.3.7.
Tue Sep 14 21:25:22 EST 2004
Improved performance of the HTML extractor by avoiding parsing
after the header (factor of 25 improvement for a 4 MB HTML file
resulting in a total improvement for total extraction time for
running all extractors of about 50%). Improved performance
of the ZIP extractor for non-zip files by testing for the ZIP
header before trying to locate the central directory (for 5 MB
/dev/random time improves by a factor of about 15). Same change
was also applied to the OO extractor (since OO is effectively a
zip). Overall improvement for 5 MB /dev/random for running
all extractors is a factor of 10 (now takes 100ms on my machine
to run 720 times on the same 5 MB file passing that file as an
argument; the remaining time is pretty much doing 720x mmap and
related system calls).
Fri Sep 10 22:00:09 EST 2004
Added support for RipeMD-160.
Fri Sep 10 19:49:39 EST 2004
Added support for SHA-1 and MD5. Releasing libextractor 0.3.6.
Fri Sep 10 10:35:27 EST 2004
Added support for OpenOffice documents (meta.xml in
zip-file).
Mon Aug 30 23:16:17 IST 2004
Added support for OLE2 (WinWord, Excel, PowerPoint).
Fixed various bugs (Segfault in elf, leaks in zip and RPM,
out-of-bounds access in QT). Releasing libextractor 0.3.5.
Wed Aug 25 18:42:11 IST 2004
Added support for GNU gettext. Releasing libextractor 0.3.4.
Fri Jul 2 20:10:54 IST 2004
Using mime-types to selectively disable parsing extractors
to increase performance.
Wed Jun 23 13:37:02 IST 2004
Added support for wav. Fixed problems in mpeg and riff
extractors. Releasing libextractor 0.3.3.
Sun Jun 6 18:42:28 IST 2004
Fixed segfault in qtextractor.
Mon May 31 18:19:07 EST 2004
Fixed more minor bugs. Releasing libextractor 0.3.2.
Mon May 31 17:14:55 EST 2004
Removed comment extraction from RIFF extractor (format
detection is not good enough to avoid garbage for non-RIFF
files). Also fixed rare seg-fault in PDF-extractor (xpdf
author notified).
Mon May 24 13:40:27 EST 2004
Changed build system to avoid having an extra library
(libextractor_util is gone).
Wed Apr 28 19:28:39 EST 2004
Releasing libextractor 0.3.1.
Wed Apr 28 01:26:53 EST 2004
Added ELF extractor.
Sat Apr 24 00:07:31 EST 2004
Fixed memory leak in PDF-extractor.
Mon Apr 12 01:30:20 EST 2004
Added Java binding. If jni.h is present (and working!),
libextractor is build with a couple of tiny additional
methods that are sufficient to build a Java class to
access libextractor. The API is still incomplete but
already basically functional. Releasing 0.3.0
Sat Apr 10 01:34:04 EST 2004
Added RIFF/AVI extractor based on AVInfo.
Fixed memory-leak and potential segfault in zipextractor.
Sat Apr 10 00:30:19 EST 2004
Added MPEG (video) extractor based on AVInfo. Improved
output of mp3 extractor.
Fri Apr 9 22:58:51 EST 2004
Improved library initialization (and destruction) code.
Thu Apr 8 22:25:19 EST 2004
Revisited type signatures adding const where applicable.
Improved formatting of --help for extract. Added some
testcases. Updated man-pages.
Wed Apr 7 00:26:29 EST 2004
Made HTML and ZIP extractors re-entrant.
Fixed minor problems in ZIP extractor (possible segfault,
possible memory leaks; both for invalid ZIP files).
Sun Apr 4 20:24:39 EST 2004
Added TIFF extractor. Fixed segfault in removeLibrary.
Port to mingw. Releasing 0.2.7.
Tue Oct 14 17:43:09 EST 2003
Fixed segfault in PDF and RPM extractors.
Fixed BSD compile errors. Port to OSX.
Releasing 0.2.6.
Sun Oct 12 18:05:37 EST 2003
Ported to OSX, fixing endianess issues with printable
extractors.
Tue Jul 22 11:38:42 CET 2003
Fixed segfault with option -b for no keywords found.
Wed Jul 16 13:41:34 EST 2003
Releasing 0.2.5.
Mon Jun 30 21:27:42 EST 2003
Releasing 0.2.4.
Sun Jun 15 18:05:24 EST 2003
Added support for pspell to printableextractor.
Sat Apr 19 04:11:14 EST 2003
Fixed missing delete operation in PDF extractor for
non-PDF files (caused memory leak and file-handle leak).
Thu Apr 10 23:54:17 EST 2003
Fixed segmentation violation in png extractor.
Thu Apr 10 01:34:49 EST 2003
Rewrote RPM extractor to make it no longer depend on rpmlib.
Fri Apr 4 21:39:55 EST 2003
Added QT extractor, but again not really tested due to lack of
QuickTime file with meta-data in it.
Thu Apr 3 23:09:44 EST 2003
Added ASF extractor, but not really tested due to lack of
ASF file with meta-data in it.
Thu Apr 3 04:04:19 EST 2003
Fixing ogg-extractor to work with new version of libvorbis that
requires us to link against libvorbisfile.
Wed Apr 2 22:22:16 EST 2003
Cleaned up plugin mechanism (ltdl).
Wed Apr 2 12:09:27 EST 2003
zipextractor now works with self-extracting zip executables.
Sat Feb 01 05:35:24 EST 2003
Changed loading of dynamic libraries to the more portable libltdl.
Thu Jan 23 00:34:20 EST 2003
Wrote RPM extractor.
Tue Jan 21 03:11:02 EST 2003
Fixed minor bug in ps extractor (now stops parsing at %%EndComments).
Thu Jan 9 18:41:01 EST 2003
License changed to GPL (required for pdf extractor), releasing 0.1.4.
Tue Jan 7 18:31:38 EST 2003
Added postscript (ps) extractor.
Tue Dec 31 15:26:00 EST 2002
Added pdf extractor based on xpdf code.
Tue Dec 17 20:36:13 CET 2002
Added MIME-extractor.
Fri Nov 22 21:54:10 EST 2002
Fixed portability problems with the gifextractor, in particular
the code now ensures that C compilers that do not pack the structs
are still going to result in working code.
Tue Oct 1 14:01:16 EST 2002
Fixed segmentation fault in ogg extractor.
Fri Jul 26 16:25:38 EST 2002
Added EXTRACTOR_ to every symbol in the extractor API to
avoid name-clashes.
Wed Jun 12 23:42:55 EST 2002
Added a dozen options to extract.
Fri Jun 7 01:48:34 EST 2002
Added support for real (real.com).
Fri Jun 7 00:21:40 EST 2002
Added support for GIF (what a crazy format).
Tue Jun 4 23:21:38 EST 2002
Added support for PNG, no longer reading the
file again and again for each extractor (slight
interface change, mmapping).
Sun Jun 2 22:49:17 EST 2002
Added support for JPEG and HTML. HTML does not
support concurrent use, though (inherent problem
with libhtmlparse). Released v0.0.2.
Sat May 25 16:56:59 EST 2002
Added building of a description from artist,
title and album, fixed bugs.
Tue May 21 22:24:07 EST 2002
Added removing of duplicates, splitting keywords,
extraction of keywords from filenames.
Sat May 18 16:33:28 EST 2002
more convenience methods ('configuration', default
set of libraries, remove all libraries)
Sat May 18 02:33:28 EST 2002
ogg extractor works, mp3 extractor now always works
Thu May 16 00:04:03 EST 2002
MP3 extractor mostly works.
Wed May 15 23:38:31 EST 2002
The basics are there, let's write extractors!
|