Issue
From my dataframe column, I want to remove the "-" character and the letters following it.
For eg.
Input dataframe
Name Subset
Apple AP-, GP-
Bat BT-01A, KL
Cat CT-L, OK-01
Output desired
Name Subset
Apple AP,GP
Bat BT,KL
Cat CT,OK
Solution
You can use -[^,]*
to match everything from -
till a comma, here we use negation [^,]
to match a non comma character:
df['Subset'] = df.Subset.str.replace('-[^,]*', '', regex=True)
df
Name Subset
0 Apple AP, GP
1 Bat BT, KL
2 Cat CT, OK
Answered By – Psidom
Answer Checked By – Dawn Plyler (BugsFixing Volunteer)