Re: [nottingham] Hassle with perl script

From: Matthew Sackman (matthew@sackman.co.uk)
Date: Wed 09 Jan 2002 - 20:20:01 GMT


On Wed, Jan 09, 2002 at 02:37:13PM +0000, Paul Sladen wrote:
> On 9 Jan 2002, mike wrote:
> > On Tue, 2002-01-08 at 18:11, Paul Sladen wrote:
> > >
> > > Crumps... it's 4am in the morning... nowt good.
> >
> > Me again - solved most of it in perl except for one annoyance - for the
> > life of me I cant match > int he HTML - taags on the other side are fine
> > just wont let me get rid of it
>
> [<][Aa] +[^>]href="([^"]+)"[^>][>]
>
> Or some such might be the way to get around it; you'll need to split
> multiple <a> tags on one line two.

There are many ways of doing this, and many Perl modules suitable for this.
For this kind of work I have come across a book called "Data Munging with
Perl" which is simply excellant and makes a boring topic embarassingly
interesting! :-)

For those interested, it's normally about 30ukp, written by David Cross,
published by Manning, ISBN 1-930110-00-6

It really is a good resource for this kind of thing.

Matthew

-- 

Matthew Sackman Nottingham England

BOFH Excuse Board: I'd love to help you -- it's just that the Boss won't let me near the computer.


-------------------------------------------------------------------- http://www.lug.org.uk http://www.linuxportal.co.uk http://www.linuxjob.co.uk http://www.linuxshop.co.uk --------------------------------------------------------------------



This archive was generated by hypermail 2.1.3 : Wed 09 Jan 2002 - 20:20:27 GMT