Refind returning strange results
I'm having to read the <title> and <h1> contents of a group of web pages in order to insert them into a database. My problem is that some of the H1 tags have attributes (like class= or align=). I'm using the following code to pull the H1:
<cfset regex1 = "<h1[^>]*>(.+?)</h1>">
Then feeding that into a refindnocase().
But the result I'm getting isn't the tag contents, but the attribute itself. How can I make the refind ignore the attributes of the H1 and just return the tag contents? The code works great if there are no attributes, but not when there are.
The odd part is that I always thought with a regex, only the things in (parens) were saved for later use. I don't get it.
