I am trying to extract wanted text from a given set of text. I have created below function.
def extract_name(title):
matches = re.findall(r'\b[A-Z0-9\s&.,()-]+(?:\s*\(\d\))?\b', title)
return ', '.join(matches) if matches else None
But, it produces unwanted (, ,
for some titles. For example, my title are like below.
THETA COMMERCIALS (2005) LIMITED, TEST CONNECTIONS LTD (In Relation), NANO CARE LIMITED (In Relation)
Expected outcome: THETA COMMERCIALS (2005) LIMITED, TEST CONNECTIONS LTD, NANO CARE LIMITED