DjangoBB LoFi version

Начало » Python для новичков » XPath вопрос на примере

agryn

Дек. 29, 2012 00:52:39

Есть пример html-кода

<html>
    <div class="myclass">
        <p>
            some html code
        </p>
        <br>
            some html code 2       
    </div>
</html>

если прописать

 '//div[@class="myclass"]'

то извлекаться

    <div class="myclass">
        <p>
            some html code
        </p>
        <br>
            some html code 2       
    </div>

а как извлечь?

        <p>
            some html code
        </p>
        <br>
            some html code 2

mironich

Дек. 29, 2012 04:48:17

 '//div[@class="myclass"]/p' #Для some html code

        <p>
            some html code
        </p>
        <br>
            some html code 2

У библиотеки lxml есть у обьектов(HTMLElement) атрибут text.

py.user.next

Дек. 29, 2012 11:51:42

>>> s = """
... <html>
...     <div class="myclass">
...         <p>
...             some html code
...         </p>
...         <br>
...             some html code 2       
...     </div>
... </html>
... """
>>> 
>>> root = lxml.html.fromstring(s)
>>> elems = root.xpath('//div[@class="myclass"]/*')
>>> for i in elems:
...     print('<{0}> <{1}>'.format(i.tag, i.text))
... 
<p> <
            some html code
        >
<br> <None>
>>>