python - Storing the unknown Id of an html tag -

so trying scrape html using beautifulsoup, having problems finding tag id using python 3.4. know tag ("tr") is, id changing , save id when changes. example:

<div class = "thisclass"   <table id = "thistable">     <tbody>       <tr id="what want">         <td class = "someinfo">    <tbody>              <table> <div>

i can find div tag , table, , know tr tag there, want extract text next id, without knowing text going say.

so far have code:

soup = beautifulsoup(url.read()) divtag = soup.find_all("table",id ="thistable") = 0 in divtag:   trtag = soup.find("tr", id)    print(trtag)       = i+1

if help me solve problem appreciate it.

you can utilize css selector:

print([element.get('id') element in soup.select('table#thistable tr[id]'))

python html tags web-scraping beautifulsoup

Search This Blog

Three

python - Storing the unknown Id of an html tag -

Comments

Post a Comment

Popular posts from this blog

php - Android app custom user registration and login with cookie using facebook sdk -

ruby on rails - Devise Logout Error in RoR -

c# - Create a Notification Object (Email or Page) At Run Time -- Dependency Injection or Factory -