On Wed, Jan 09, 2002 at 02:37:13PM +0000, Paul Sladen wrote:
> On 9 Jan 2002, mike wrote:
> > On Tue, 2002-01-08 at 18:11, Paul Sladen wrote:
> > >
> > > Crumps... it's 4am in the morning... nowt good.
> >
> > Me again - solved most of it in perl except for one annoyance - for the
> > life of me I cant match > int he HTML - taags on the other side are fine
> > just wont let me get rid of it
>
> [<][Aa] +[^>]href="([^"]+)"[^>][>]
>
> Or some such might be the way to get around it; you'll need to split
> multiple <a> tags on one line two.
There are many ways of doing this, and many Perl modules suitable for this.
For this kind of work I have come across a book called "Data Munging with
Perl" which is simply excellant and makes a boring topic embarassingly
interesting! :-)
For those interested, it's normally about 30ukp, written by David Cross,
published by Manning, ISBN 1-930110-00-6
It really is a good resource for this kind of thing.
Matthew
--Matthew Sackman Nottingham England
BOFH Excuse Board: I'd love to help you -- it's just that the Boss won't let me near the computer.
This archive was generated by hypermail 2.1.3 : Wed 09 Jan 2002 - 20:20:27 GMT