Monday, February 14, 2011

Sunday, February 13, 2011

PDF (Tables) to Text more

pdftotext (1) - Portable Document Format (PDF) to text converter (version 3.00)

pdftk - useful tool for manipulating PDF documents


There are a number of PDF-to-Excel things available commercially ... one (Nitro) has a demo online but limits filesize input to 10MB :(


devmem2.c, access device registers from userspace Linux

Builds in target machine (or can be cross-compiled)

Run it without params for usage information.

Addresses are Physical.

Prefix with 0x for hex addresses.

This can be wrapped in a script to give multiple register values read out.

Extracting text from PDF file with PDFBox ExtractText command

Needs some installation (assuming PDFbox isn't installed already e.g. by your Linux distribution):

SVN pdfbox trunk

apt-get install maven2

mvn clean install

On successful building it, use it like:

usage: java -jar pdfbox-app-x.y.z.jar org.apache.pdfbox.ExtractText [OPTIONS] [Text file]

rbarraud@thinky:~/Desktop/tools/PDFBox/trunk/app/target$ java -jar pdfbox-app-1.5.0-SNAPSHOT.jar ExtractText -html /home/rbarraud/Desktop/Reference/pdfs/IMX25RM.pdf /tmp/MX25RM.html

I want to grab tables out of PDF processor reference manuals to make tools for browsing machine state by register names etc.

Thursday, February 3, 2011

iproute2 replacing ifconfig etc (net-tools) ?

Turning on bash's globstar ...

magilla@think:~/Desktop$ shopt -s globstar
magilla@think:~/Desktop$ shopt globstar
globstar on

Helps with recursive grep - you can then use

# grep -inH "find_this_expr" **/*.[CcHh]

or similar to recurse into all subdirs.