packages icon
Readme file for: findgrep

Programme: findgrep

grep's in current as well as in subdirectories.
Perl reg.exp.s can be used. Supports approximate matches.

usage:

  findgrep [-lnisefcpdtBhaFSx] <reg.exp.> [<file name reg.exp.>]

Author: Lars Gregersen
        E-mail:   lg@kt.dtu.dk
        Homepage: http://www.gbar.dtu.dk/~matlg/

findgrep webpage: http://www.gbar.dtu.dk/~matlg/findgrep/

1997.10.01, version 1.00: Creation of a great new programme
1997.10.07, version 1.10: added -s option
1997.10.08, version 1.20: added -e, -f and -c options
1997.10.22, version 1.30: added -p option
1997.11.05, version 1.31: changed option -c to -b
1997.11.07, version 1.40: added -P, -d, -t and -B option
1997.12.17, version 1.41: runs nicely with Perl -w option
                          added -h option
1998.03.12, version 1.50: added -a, --adiff, --ains, --adel
                           and --asub options
                          changed the handling of the -t option
                          now uses Getopt::Long to handle options
                          env.var. FINDGREP contains default arguments
                          removed option -P
                          added option -F
1998.04.09, version 1.60: fixed bug with -i option
                          fized bug: added 'local $_' to build_matchany
                          addapted to String::Approx version 2.5
1998.07.28, version 1.61: allowed "cpp,h" when using -e option
                          will now give an error message and
                           continue when an unreadable file is met
1999.01.31, version 1.63: added option -S (show paragraph)
                          -f now handles ..
                          adapted to String::Approx version 3.05
                           (works with version 2.5+)  
1999.02.07, version 1.64: bug fixes for options -S and -s
                          -S option may now take an integer as argument
1999.04.19, version 1.70: added option -x

findgrep [-lnisefcpdtBhaFSx] <reg.exp.> [<file name reg.exp.>]

 Options:
  -l : list file names
  -n : line numbers
  -i : case insensitive
  -s : shorten long lines
  -e : treat file name reg.exp. as an extension
  -f : use directory names in file name reg.exp.
       and convert . to \. and * to .*
  -b : only search base directory
  -p : write path name with the file names
  -d : match file name reg.exp. on path name and
       file name (concatenated)
  -t : transform file names to lower case
  -B : don't search binary files
  -h : help screen
  -a : approximate search
  -F : use pattern file
  -S : show paragraph
  -x : exclude file/directory pattern


Options can be specified in the environmental variable FINDGREP just
like on the command line. Options given in the FINDGREP variable will
be overridden by the options given on the commandline.

An option can be turned off on the command line bu specifying '--no'
before the option letter. E.g. if you have specified 'FINDGREP=-n' the
numbering can be turned of on the command line using '--non'.

The options have the same effect whether specified on the command line
except for the approximate match modifiers. See approximate match for
a specification.

On DOS systems the following definition in autoexec.bat is often
useful
set FINDGREP=-t



Description of options:

 -l : list file names

List only the filenames of the files that have lines that match the
criteria. If used in combination with the -n option the lines numbers
of the first matching line is shown.


 -n : line numbers

Show line numbers of matching lines.


 -i : case insensitive

Do a case insensitive search. This makes the regular expresions case
insensitive for both the search pattern and the path and file name
pattern.


 -s : shorten long lines

Show only a few lines of text around the place where some matching
text is found. This feature is useful when searching binary files
that do not have "standard" end of line chracters.


 -e : treat file name reg.exp. as an extension

Use the file name specification as an extension (suffix) instead of a
regular expression. That means that "cpp" will be tranferred into
"\.cpp$". You may specify more than one extension by using "(h|cpp)",
which will search files that have either "h" or "cpp" as extension.
When used with the -f option the last part of the file name
specification will be used as the extension.


 -f : use directory names in file name reg.exp.
      and convert . to \. and * to .*

Make it possible to specify the base directory for the recursive
search. The remeaning part of the file name specification will have
the characters "." and "*" translated into "\." and ".*" respectively
and in that order. This makes it easier to specify file names in a
'glob' kind of way. But remember that regular expressions are still
used.

  Examples:

Search for the pattern "foo" using bar as base directory in all files that matches the pattern "filename"
  findgrep -f foo bar/filename

Search for the pattern "foo" in files that start with "bar" and
has the extension "h", but note that a file called "foobar.h.bak"
will also be found by this search criterion. The search criterion
used is "bar.*\.h".
  findgrep -f foo "bar*.h"

Search for the pattern "foo" in a file that has "bar" in it followed by a number somewhere using the directory "tmpdir" as the base directory.
  findgrep -f foo "tmpdir/bar*\d"

Search for the pattern "foo" in files that matches the pattern bar using the previous directory as base directory.
  findgrep -f foo ../bar


 -b : only search base directory

Only search the base directory. This makes findgrep work almost as
grep.


 -p : write path name with the file names instead of
      in seperate lines

The normal output of findgrep is to first specify the directory where
the search is made and then to write the file names without directory
names. The -p option changes this so the directory names are written
with the files.


 -d : match file name reg.exp. on path name and
      file name (concatenated)

Instead of just matching the file name the file name reg.exp. must
match the path name and the file name when they are concatenated

Example:

search for "foo" when the path name and file name will match
"bar.*foo". Will e.g. match "bar/somedir/foo".
  findgrep -d foo "bar.*foo"

searh for "foo" using the "dir1" as base directory and looking for
files which have a path name (relative to the current file
directory) and file name that matches "bar*txt". It will e.g. match
"dir1/dir2/bar/readme.txt"
  findgrep -fd foo "dir1/bar*txt"


 -t : transform file names to lower case

File names and path names are transformed to lower case before using
in tests and when printing. The case is preserved when opening files.
Useful on file systems that do not distinguish between (but allow)
upper and lower case (like Windows 96/NT).


 -B : don't search binary files

Don't search files that appear to be binary.


 -h : help screen

Print a help screen with the most commonly used options.


 -a : approximate search

 Search for approximate match. This is good for searching for
 misspelled
 information and when you don't know the precise spelling of words.
 Warning: the approximate search takes much longer time than the
 normal
 search. Note that String::Approx must be installed for this to work.

 --adiff, --ains, --adel --asub

Approximate is defined here as k-differences.  One difference
is an insertion, a deletion, or a substitution of one character.
The k in the k-differences is the maximum number of differences. The
default is to allow for one difference. This can be changed using the
modifiers above.

Approximate matching will automatically be used when any of the
modifiers are specified on the command line. This means that the -a
option can be left out. If the modifiers are specified in the FINDGREP
environmental variable the approximate match is not used unless the -a
option is specified. This feature allow one to specify defaults of the
modifers.

 Examples

Search for 'Tschaikowskij' with one difference
  findgrep -ea Tschaikowskij

Search for 'Tschaikowskij' with one substitution and no deletions or
insertions
  findgrep -e --adiff=0 --asub=1 'Tschaikowskij'

FINDGREP contains '--adiff=2' and we want to search for
'Tschaikowskij' with 1 insertion, 2 deletions and 2 substitutions
  findgrep --ains=1 Tschaikowskij


 -F : use pattern file

Use the pattern in the specified pattern file for searches. A regular 
expression cannot be specified on the command line if this option is 
used. The pattern in the pattern file are separated by newlines.

 Example
The file 'pattern.ptn' contains:
red
blue
green

Use findgrep to find these words in text files:
  findgrep -e -Fpattern.ptn txt


  -S : show paragraph
  
The standard output of findgrep is a single line where the matching 
text if found. This is out of context with the rest of the text/file.
By using the option -S the entire paragraph where the matching text is 
found will be displayed. findgrep uses an empty line to determine where
paragraphs start and end. Note that this option can create a lot of
output if paragraphs are long. This option can be combined with the -s option.

If used in combination with the -n parameter the line number of the first matching line is shown with the file name (note that there can be more than more match in a paragraph); the line number of the lines in the paragraph will be shown as well.

When the -S option is used the number of lines in a paragraph is limited to a default of 100 lines. This number can be changed using the option
  -S=<number>

 Example
A search for the word "foo" showing only entire paragraphs when the paragraph is smaller than 25 lines.
  findgrep -S=25 foo
If paragraphs larger than 25 lines is encountered only the matching line is shown.


  -x : exclude file/directory pattern

Use this option to exclude files and/or directories that is otherwise included. File names and directories must first positively match their respectively specified patterns before this pattern is checked.

 Example
Search for "foo" in .ini files, but exclude files that contain "system"
  findgrep -e -x="system" foo ini
Note that this search expression will search for files in directories containing "system".

Search for "foo" in .ini files, but exclude directories that contain "system"
  findgrep -ed -x="/system/" foo ini
Note that this search expression will search for files that contain the phrase "system".

Search for "foo" in .ini files, but exclude paths that contain "system". I.e., do not contain "system" in the directory name or file name.
  findgrep -ed -x="system" foo ini