Форум сайта python.su
2
<td width="5%" height="42"> </td> <td width="95%" valign="top">Gentleman's ID: <a href="javascript:Show2('G123456');" class="nor"> G123456 </a> ( robert catleyday )<br> Lady's Profile ID: <a href="javascript:ShowWin2('woman/women_preview_profile.php?womanid=M123456','window','toolbar=no,location=no,directories=no,status=no,scrollbars=yes,menubar=no,resizable=yes,width=780,height=500,top=10,left=7')" class="nor"> M123456 </a> ( Анастасия - Any Briggs )</td> </tr> <tr>
g = re.findall(r'(\s\w+\s)\s+(\s\w+\s)', html)[0][0].strip().capitalize()
Офлайн
857
>>> import re >>> >>> s = """ ... <td width="5%" height="42"> </td> ... <td width="95%" valign="top">Gentleman's ID: <a href="javascript:Show2('G123456');" class="nor"> ... G123456 </a> ( ... robert catleyday )<br> ... Lady's Profile ID: <a href="javascript:ShowWin2('woman/women_preview_profile.php?womanid=M123456','window','toolbar=no,location=no,directories=no,status=no,scrollbars=yes,menubar=no,resizable=yes,width=780,height=500,top=10,left=7')" class="nor"> ... M123456 </a> ( ... Анастасия - ... Any Briggs )</td> ... </tr> ... <tr> ... """ >>> >>> pat = r' \(\s+(\S+)\s+(\S+)\s+\)' >>> >>> lst = re.findall(pat, s) >>> lst [('robert', 'catleyday')] >>>
Офлайн