Monday, February 14, 2011

Sunday, February 13, 2011

PDF (Tables) to Text more

pdftotext (1) - Portable Document Format (PDF) to text converter (version 3.00)

pdftk - useful tool for manipulating PDF documents

PDFBox

There are a number of PDF-to-Excel things available commercially ... one (Nitro) has a demo online but limits filesize input to 10MB :(

devmem2.c

http://www.lartmaker.nl/lartware/port/devmem2.c

devmem2.c, access device registers from userspace Linux

Builds in target machine (or can be cross-compiled)

Run it without params for usage information.

Addresses are Physical.

Prefix with 0x for hex addresses.

This can be wrapped in a script to give multiple register values read out.

Extracting text from PDF file with PDFBox ExtractText command

http://pdfbox.apache.org/commandlineutilities/ExtractText.html

Needs some installation (assuming PDFbox isn't installed already e.g. by your Linux distribution):

SVN pdfbox trunk

apt-get install maven2

mvn clean install

On successful building it, use it like:

usage: java -jar pdfbox-app-x.y.z.jar org.apache.pdfbox.ExtractText [OPTIONS] [Text file]

e.g.:
rbarraud@thinky:~/Desktop/tools/PDFBox/trunk/app/target$ java -jar pdfbox-app-1.5.0-SNAPSHOT.jar ExtractText -html /home/rbarraud/Desktop/Reference/pdfs/IMX25RM.pdf /tmp/MX25RM.html

I want to grab tables out of PDF processor reference manuals to make tools for browsing machine state by register names etc.

Thursday, February 3, 2011

iproute2 replacing ifconfig etc (net-tools) ?

http://www.linuxfoundation.org/collaborate/workgroups/networking/iproute2

Turning on bash's globstar ...


magilla@think:~/Desktop$ shopt -s globstar
magilla@think:~/Desktop$ shopt globstar
globstar on


Helps with recursive grep - you can then use

# grep -inH "find_this_expr" **/*.[CcHh]

or similar to recurse into all subdirs.