Step 2b
Filtering Out the Noise with Regex
Extracting attribute values
<\/strong>([^<]*)(?:<[^>]*>|$)
-
match the character "<" literally
-
match the character "/" literally
-
match the characters "strong>" literally
-
match the regular expression below and capture its match into backreference number 1
-
match any character that is not a "<"
-
between zero and unlimited times, as many times as possible, giving back as needed (greedy)
-
match the regular expression below
-
match either the regular expression below (attempting the next alternative only if this one fails)
-
match the character "<" literally
-
match any character that is not a ">"
-
between zero and unlimited times, as many times as possible, giving back as needed (greedy)
-
match the character ">" literally
-
or match regular expression number 2 below (the entire group fails if this one fails to match)
-
assert position at the end of the string (or before the line break at the end of the string, if any)
← Back to post