Sometimes, we want to extract part of a regex match with Python.
In this article, we’ll look at how to extract part of a regex match with Python.
How to extract part of a regex match with Python?
To extract part of a regex match with Python, we can use the re.search method with a regex that gets a match from within a pattern.
For instance, we write:
import re
html = '<title>hell world</title>'
title_search = re.search('<title>(.*)</title>', html, re.IGNORECASE)
if title_search:
title = title_search.group(1)
print(title)
We want to extract the text between the title tags in html.
To do that, we call re.search with '<title>(.*)</title> to get the content between the title tags.
Then we pass in html and re.IGNORECASE as the other arguments to search html in a case-insensitive manner.
Then we get the match from the regex group with title_search.group(1).
Therefore, title should be 'hello world'.
Conclusion
To extract part of a regex match with Python, we can use the re.search method with a regex that gets a match from within a pattern.