Вот исходный код страницы: view-source:
www.pythonscraping.com/pages/page3.html
Вот мой код:
from urllib.request import urlopen
from urllib.error import HTTPError
from bs4 import BeautifulSoup as bs
from pprint import pprint
html_page = urlopen("http://www.pythonscraping.com/pages/page3.html")
soup = bs(html_page.read(), "html.parser")
for i in soup.find('table', {'id':'giftList'}).children:
print(i)
Я считаю, что должны выводится все теги tr, без содержимого td, и другие теги лежащие на одном уровне с tr. Но выводится это:
<tr>
<th>
Item Title
</th>
<th>
Description
</th>
<th>
Cost
</th>
<th>
Image
</th>
</tr>
<tr class="gift" id="gift1">
<td>
Vegetable Basket
</td>
<td>
This vegetable basket is the perfect gift for your health conscious (or overweight) friends!
<span class="excitingNote">Now with super-colorful bell peppers!</span>
</td>
<td>
$15.00
</td>
<td>
<img src="../img/gifts/img1.jpg" />
</td>
</tr>
<tr class="gift" id="gift2">
<td>
Russian Nesting Dolls
</td>
<td>
Hand-painted by trained monkeys, these exquisite dolls are priceless! And by "priceless," we mean "extremely expensive"! <span class="excitingNote">8 entire dolls per set! Octuple the presents!</span>
</td>
<td>
$10,000.52
</td>
<td>
<img src="../img/gifts/img2.jpg" />
</td>
</tr>
<tr class="gift" id="gift3">
<td>
Fish Painting
</td>
<td>
If something seems fishy about this painting, it's because it's a fish! <span class="excitingNote">Also hand-painted by trained monkeys!</span>
</td>
<td>
$10,005.00
</td>
<td>
<img src="../img/gifts/img3.jpg" />
</td>
</tr>
<tr class="gift" id="gift4">
<td>
Dead Parrot
</td>
<td>
This is an ex-parrot! <span class="excitingNote">Or maybe he's only resting?</span>
</td>
<td>
$0.50
</td>
<td>
<img src="../img/gifts/img4.jpg" />
</td>
</tr>
<tr class="gift" id="gift5">
<td>
Mystery Box
</td>
<td>
If you love suprises, this mystery box is for you! Do not place on light-colored surfaces. May cause oil staining. <span class="excitingNote">Keep your friends guessing!</span>
</td>
<td>
$1.50
</td>
<td>
<img src="../img/gifts/img6.jpg" />
</td>
</tr>
А если поменять children на descendants, то выводится это:
<tr>
<th>
Item Title
</th>
<th>
Description
</th>
<th>
Cost
</th>
<th>
Image
</th>
</tr>
<th>
Item Title
</th>
Item Title
<th>
Description
</th>
Description
<th>
Cost
</th>
Cost
<th>
Image
</th>
Image
<tr class="gift" id="gift1">
<td>
Vegetable Basket
</td>
<td>
This vegetable basket is the perfect gift for your health conscious (or overweight) friends!
<span class="excitingNote">Now with super-colorful bell peppers!</span>
</td>
<td>
$15.00
</td>
<td>
<img src="../img/gifts/img1.jpg" />
</td>
</tr>
<td>
Vegetable Basket
</td>
Vegetable Basket
<td>
This vegetable basket is the perfect gift for your health conscious (or overweight) friends!
<span class="excitingNote">Now with super-colorful bell peppers!</span>
</td>
This vegetable basket is the perfect gift for your health conscious (or overweight) friends!
<span class="excitingNote">Now with super-colorful bell peppers!</span>
Now with super-colorful bell peppers!
<td>
$15.00
</td>
$15.00
<td>
<img src="../img/gifts/img1.jpg" />
</td>
<img src="../img/gifts/img1.jpg" />
<tr class="gift" id="gift2">
<td>
Russian Nesting Dolls
</td>
<td>
Hand-painted by trained monkeys, these exquisite dolls are priceless! And by "priceless," we mean "extremely expensive"! <span class="excitingNote">8 entire dolls per set! Octuple the presents!</span>
</td>
<td>
$10,000.52
</td>
<td>
<img src="../img/gifts/img2.jpg" />
</td>
</tr>
<td>
Russian Nesting Dolls
</td>
Russian Nesting Dolls
<td>
Hand-painted by trained monkeys, these exquisite dolls are priceless! And by "priceless," we mean "extremely expensive"! <span class="excitingNote">8 entire dolls per set! Octuple the presents!</span>
</td>
Hand-painted by trained monkeys, these exquisite dolls are priceless! And by "priceless," we mean "extremely expensive"!
<span class="excitingNote">8 entire dolls per set! Octuple the presents!</span>
8 entire dolls per set! Octuple the presents!
<td>
$10,000.52
</td>
$10,000.52
<td>
<img src="../img/gifts/img2.jpg" />
</td>
<img src="../img/gifts/img2.jpg" />
<tr class="gift" id="gift3">
<td>
Fish Painting
</td>
<td>
If something seems fishy about this painting, it's because it's a fish! <span class="excitingNote">Also hand-painted by trained monkeys!</span>
</td>
<td>
$10,005.00
</td>
<td>
<img src="../img/gifts/img3.jpg" />
</td>
</tr>
<td>
Fish Painting
</td>
Fish Painting
<td>
If something seems fishy about this painting, it's because it's a fish! <span class="excitingNote">Also hand-painted by trained monkeys!</span>
</td>
If something seems fishy about this painting, it's because it's a fish!
<span class="excitingNote">Also hand-painted by trained monkeys!</span>
Also hand-painted by trained monkeys!
<td>
$10,005.00
</td>
$10,005.00
<td>
<img src="../img/gifts/img3.jpg" />
</td>
<img src="../img/gifts/img3.jpg" />
<tr class="gift" id="gift4">
<td>
Dead Parrot
</td>
<td>
This is an ex-parrot! <span class="excitingNote">Or maybe he's only resting?</span>
</td>
<td>
$0.50
</td>
<td>
<img src="../img/gifts/img4.jpg" />
</td>
</tr>
<td>
Dead Parrot
</td>
Dead Parrot
<td>
This is an ex-parrot! <span class="excitingNote">Or maybe he's only resting?</span>
</td>
This is an ex-parrot!
<span class="excitingNote">Or maybe he's only resting?</span>
Or maybe he's only resting?
<td>
$0.50
</td>
$0.50
<td>
<img src="../img/gifts/img4.jpg" />
</td>
<img src="../img/gifts/img4.jpg" />
<tr class="gift" id="gift5">
<td>
Mystery Box
</td>
<td>
If you love suprises, this mystery box is for you! Do not place on light-colored surfaces. May cause oil staining. <span class="excitingNote">Keep your friends guessing!</span>
</td>
<td>
$1.50
</td>
<td>
<img src="../img/gifts/img6.jpg" />
</td>
</tr>
<td>
Mystery Box
</td>
Mystery Box
<td>
If you love suprises, this mystery box is for you! Do not place on light-colored surfaces. May cause oil staining. <span class="excitingNote">Keep your friends guessing!</span>
</td>
If you love suprises, this mystery box is for you! Do not place on light-colored surfaces. May cause oil staining.
<span class="excitingNote">Keep your friends guessing!</span>
Keep your friends guessing!
<td>
$1.50
</td>
$1.50
<td>
<img src="../img/gifts/img6.jpg" />
</td>
<img src="../img/gifts/img6.jpg" />
Почему children работает не так, как я говорил выше?