Div text not showing up with Selenium Python2019 Community Moderator ElectionCalling an external command in PythonWhat are metaclasses in Python?How to horizontally center a <div>?Finding the index of an item given a list containing it in PythonDifference between append vs. extend list methods in PythonHow can I safely create a nested directory in Python?Does Python have a ternary conditional operator?How to get the current time in PythonHow can I make a time delay in Python?Does Python have a string 'contains' substring method?
What does Jesus mean regarding "Raca," and "you fool?" - is he contrasting them?
If "dar" means "to give", what does "daros" mean?
Help prove this basic trig identity please!
What exactly term 'companion plants' means?
Does the attack bonus from a Masterwork weapon stack with the attack bonus from Masterwork ammunition?
Am I eligible for the Eurail Youth pass? I am 27.5 years old
Violin - Can double stops be played when the strings are not next to each other?
How to get the n-th line after a grepped one?
Could Sinn Fein swing any Brexit vote in Parliament?
Print a physical multiplication table
Pronounciation of the combination "st" in spanish accents
Bash - pair each line of file
Hausdorff dimension of the boundary of fibres of Lipschitz maps
Practical application of matrices and determinants
How does 取材で訪れた integrate into this sentence?
What (if any) is the reason to buy in small local stores?
Recruiter wants very extensive technical details about all of my previous work
Asserting that Atheism and Theism are both faith based positions
Is honey really a supersaturated solution? Does heating to un-crystalize redissolve it or melt it?
Loading the leaflet Map in Lightning Web Component
Why is there so much iron?
How can an organ that provides biological immortality be unable to regenerate?
What does Deadpool mean by "left the house in that shirt"?
How are passwords stolen from companies if they only store hashes?
Div text not showing up with Selenium Python
2019 Community Moderator ElectionCalling an external command in PythonWhat are metaclasses in Python?How to horizontally center a <div>?Finding the index of an item given a list containing it in PythonDifference between append vs. extend list methods in PythonHow can I safely create a nested directory in Python?Does Python have a ternary conditional operator?How to get the current time in PythonHow can I make a time delay in Python?Does Python have a string 'contains' substring method?
I"m putting together a webscraper to collect locational data for Goodwill stores based on a list of zipcodes. I have done this multiple times in the past for other stores but Goodwill's website seems to be a little different. Here is how the div that I'm trying to scrape is set up.
<div class="contact">4300 W 36 1/2 St<br>St Louis Park, MN 55416<br><div
class="phone">(952) 922-9640</div><a onclick="ga('send', 'event',
'Locator', 'Clicked Location Website Link', 'http://www.seconddebut.org');"
class="website" href="http://www.seconddebut.org">Visit Website</a></div>
From this div I want to scrape the street address, city, state, & zipcode. I have tried this code
htmlSource = driver.page_source
soup = BeautifulSoup(htmlSource, 'html.parser')
stores = soup.find("div", attrs="class":"contact")
for store in stores:
print store.get_text()
I have also tried this
soup = BeautifulSoup(htmlSource, 'html.parser')
stores = soup.find("div", attrs="class":"contact")
children = stores.findChildren("br", recursive=False)
for child in children:
print child
Neither of these options have worked for me yet. Any help would be greatly appreciated!
python html selenium-webdriver
add a comment |
I"m putting together a webscraper to collect locational data for Goodwill stores based on a list of zipcodes. I have done this multiple times in the past for other stores but Goodwill's website seems to be a little different. Here is how the div that I'm trying to scrape is set up.
<div class="contact">4300 W 36 1/2 St<br>St Louis Park, MN 55416<br><div
class="phone">(952) 922-9640</div><a onclick="ga('send', 'event',
'Locator', 'Clicked Location Website Link', 'http://www.seconddebut.org');"
class="website" href="http://www.seconddebut.org">Visit Website</a></div>
From this div I want to scrape the street address, city, state, & zipcode. I have tried this code
htmlSource = driver.page_source
soup = BeautifulSoup(htmlSource, 'html.parser')
stores = soup.find("div", attrs="class":"contact")
for store in stores:
print store.get_text()
I have also tried this
soup = BeautifulSoup(htmlSource, 'html.parser')
stores = soup.find("div", attrs="class":"contact")
children = stores.findChildren("br", recursive=False)
for child in children:
print child
Neither of these options have worked for me yet. Any help would be greatly appreciated!
python html selenium-webdriver
Is this from goodwilleasterseals.org/shop/find-store? The HTML there doesn't look like this. Since it seems to be a public URL, can you post it?
– JeffC
Mar 7 at 17:30
Jeff, I've been using this url goodwill.org/locator but it looks like there are the same locations on the url that you've prodived as that one that I've been using. I'm going to try yours
– jnord
Mar 7 at 18:12
I still don't see the HTML that you posted. Can you edit your question and put in the step by step instructions of how you would manually get to the results page you are trying to scrape?
– JeffC
Mar 8 at 1:19
add a comment |
I"m putting together a webscraper to collect locational data for Goodwill stores based on a list of zipcodes. I have done this multiple times in the past for other stores but Goodwill's website seems to be a little different. Here is how the div that I'm trying to scrape is set up.
<div class="contact">4300 W 36 1/2 St<br>St Louis Park, MN 55416<br><div
class="phone">(952) 922-9640</div><a onclick="ga('send', 'event',
'Locator', 'Clicked Location Website Link', 'http://www.seconddebut.org');"
class="website" href="http://www.seconddebut.org">Visit Website</a></div>
From this div I want to scrape the street address, city, state, & zipcode. I have tried this code
htmlSource = driver.page_source
soup = BeautifulSoup(htmlSource, 'html.parser')
stores = soup.find("div", attrs="class":"contact")
for store in stores:
print store.get_text()
I have also tried this
soup = BeautifulSoup(htmlSource, 'html.parser')
stores = soup.find("div", attrs="class":"contact")
children = stores.findChildren("br", recursive=False)
for child in children:
print child
Neither of these options have worked for me yet. Any help would be greatly appreciated!
python html selenium-webdriver
I"m putting together a webscraper to collect locational data for Goodwill stores based on a list of zipcodes. I have done this multiple times in the past for other stores but Goodwill's website seems to be a little different. Here is how the div that I'm trying to scrape is set up.
<div class="contact">4300 W 36 1/2 St<br>St Louis Park, MN 55416<br><div
class="phone">(952) 922-9640</div><a onclick="ga('send', 'event',
'Locator', 'Clicked Location Website Link', 'http://www.seconddebut.org');"
class="website" href="http://www.seconddebut.org">Visit Website</a></div>
From this div I want to scrape the street address, city, state, & zipcode. I have tried this code
htmlSource = driver.page_source
soup = BeautifulSoup(htmlSource, 'html.parser')
stores = soup.find("div", attrs="class":"contact")
for store in stores:
print store.get_text()
I have also tried this
soup = BeautifulSoup(htmlSource, 'html.parser')
stores = soup.find("div", attrs="class":"contact")
children = stores.findChildren("br", recursive=False)
for child in children:
print child
Neither of these options have worked for me yet. Any help would be greatly appreciated!
python html selenium-webdriver
python html selenium-webdriver
asked Mar 7 at 17:02
jnordjnord
185
185
Is this from goodwilleasterseals.org/shop/find-store? The HTML there doesn't look like this. Since it seems to be a public URL, can you post it?
– JeffC
Mar 7 at 17:30
Jeff, I've been using this url goodwill.org/locator but it looks like there are the same locations on the url that you've prodived as that one that I've been using. I'm going to try yours
– jnord
Mar 7 at 18:12
I still don't see the HTML that you posted. Can you edit your question and put in the step by step instructions of how you would manually get to the results page you are trying to scrape?
– JeffC
Mar 8 at 1:19
add a comment |
Is this from goodwilleasterseals.org/shop/find-store? The HTML there doesn't look like this. Since it seems to be a public URL, can you post it?
– JeffC
Mar 7 at 17:30
Jeff, I've been using this url goodwill.org/locator but it looks like there are the same locations on the url that you've prodived as that one that I've been using. I'm going to try yours
– jnord
Mar 7 at 18:12
I still don't see the HTML that you posted. Can you edit your question and put in the step by step instructions of how you would manually get to the results page you are trying to scrape?
– JeffC
Mar 8 at 1:19
Is this from goodwilleasterseals.org/shop/find-store? The HTML there doesn't look like this. Since it seems to be a public URL, can you post it?
– JeffC
Mar 7 at 17:30
Is this from goodwilleasterseals.org/shop/find-store? The HTML there doesn't look like this. Since it seems to be a public URL, can you post it?
– JeffC
Mar 7 at 17:30
Jeff, I've been using this url goodwill.org/locator but it looks like there are the same locations on the url that you've prodived as that one that I've been using. I'm going to try yours
– jnord
Mar 7 at 18:12
Jeff, I've been using this url goodwill.org/locator but it looks like there are the same locations on the url that you've prodived as that one that I've been using. I'm going to try yours
– jnord
Mar 7 at 18:12
I still don't see the HTML that you posted. Can you edit your question and put in the step by step instructions of how you would manually get to the results page you are trying to scrape?
– JeffC
Mar 8 at 1:19
I still don't see the HTML that you posted. Can you edit your question and put in the step by step instructions of how you would manually get to the results page you are trying to scrape?
– JeffC
Mar 8 at 1:19
add a comment |
2 Answers
2
active
oldest
votes
Try selenium
webrdriver.find_element_by_ccs_selector('selector path').text
I tried this earlier, didn't work either :/
– jnord
Mar 7 at 17:15
Have you tried different browsers with selenium? Sometimes this help. Check phantomjs too
– MTG
Mar 7 at 18:37
add a comment |
Assuming they all follow the same pattern, something like the below should work:
from bs4 import BeautifulSoup
markup = r"""
<div class="contact">4300 W 36 1/2 St<br>St Louis Park, MN 55416<br><div
class="phone">(952) 922-9640</div><a onclick="ga('send', 'event',
'Locator', 'Clicked Location Website Link', 'http://www.seconddebut.org');"
class="website" href="http://www.seconddebut.org">Visit Website</a></div>
"""
soup = BeautifulSoup(markup, "html.parser")
store = soup.find("div", attrs="class": "contact")
print(list(store.strings)[:2])
Result:
['4300 W 36 1/2 St', 'St Louis Park, MN 55416']
add a comment |
Your Answer
StackExchange.ifUsing("editor", function ()
StackExchange.using("externalEditor", function ()
StackExchange.using("snippets", function ()
StackExchange.snippets.init();
);
);
, "code-snippets");
StackExchange.ready(function()
var channelOptions =
tags: "".split(" "),
id: "1"
;
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function()
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled)
StackExchange.using("snippets", function()
createEditor();
);
else
createEditor();
);
function createEditor()
StackExchange.prepareEditor(
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: true,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: 10,
bindNavPrevention: true,
postfix: "",
imageUploader:
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
,
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
);
);
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55049229%2fdiv-text-not-showing-up-with-selenium-python%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
2 Answers
2
active
oldest
votes
2 Answers
2
active
oldest
votes
active
oldest
votes
active
oldest
votes
Try selenium
webrdriver.find_element_by_ccs_selector('selector path').text
I tried this earlier, didn't work either :/
– jnord
Mar 7 at 17:15
Have you tried different browsers with selenium? Sometimes this help. Check phantomjs too
– MTG
Mar 7 at 18:37
add a comment |
Try selenium
webrdriver.find_element_by_ccs_selector('selector path').text
I tried this earlier, didn't work either :/
– jnord
Mar 7 at 17:15
Have you tried different browsers with selenium? Sometimes this help. Check phantomjs too
– MTG
Mar 7 at 18:37
add a comment |
Try selenium
webrdriver.find_element_by_ccs_selector('selector path').text
Try selenium
webrdriver.find_element_by_ccs_selector('selector path').text
answered Mar 7 at 17:08
MTGMTG
47110
47110
I tried this earlier, didn't work either :/
– jnord
Mar 7 at 17:15
Have you tried different browsers with selenium? Sometimes this help. Check phantomjs too
– MTG
Mar 7 at 18:37
add a comment |
I tried this earlier, didn't work either :/
– jnord
Mar 7 at 17:15
Have you tried different browsers with selenium? Sometimes this help. Check phantomjs too
– MTG
Mar 7 at 18:37
I tried this earlier, didn't work either :/
– jnord
Mar 7 at 17:15
I tried this earlier, didn't work either :/
– jnord
Mar 7 at 17:15
Have you tried different browsers with selenium? Sometimes this help. Check phantomjs too
– MTG
Mar 7 at 18:37
Have you tried different browsers with selenium? Sometimes this help. Check phantomjs too
– MTG
Mar 7 at 18:37
add a comment |
Assuming they all follow the same pattern, something like the below should work:
from bs4 import BeautifulSoup
markup = r"""
<div class="contact">4300 W 36 1/2 St<br>St Louis Park, MN 55416<br><div
class="phone">(952) 922-9640</div><a onclick="ga('send', 'event',
'Locator', 'Clicked Location Website Link', 'http://www.seconddebut.org');"
class="website" href="http://www.seconddebut.org">Visit Website</a></div>
"""
soup = BeautifulSoup(markup, "html.parser")
store = soup.find("div", attrs="class": "contact")
print(list(store.strings)[:2])
Result:
['4300 W 36 1/2 St', 'St Louis Park, MN 55416']
add a comment |
Assuming they all follow the same pattern, something like the below should work:
from bs4 import BeautifulSoup
markup = r"""
<div class="contact">4300 W 36 1/2 St<br>St Louis Park, MN 55416<br><div
class="phone">(952) 922-9640</div><a onclick="ga('send', 'event',
'Locator', 'Clicked Location Website Link', 'http://www.seconddebut.org');"
class="website" href="http://www.seconddebut.org">Visit Website</a></div>
"""
soup = BeautifulSoup(markup, "html.parser")
store = soup.find("div", attrs="class": "contact")
print(list(store.strings)[:2])
Result:
['4300 W 36 1/2 St', 'St Louis Park, MN 55416']
add a comment |
Assuming they all follow the same pattern, something like the below should work:
from bs4 import BeautifulSoup
markup = r"""
<div class="contact">4300 W 36 1/2 St<br>St Louis Park, MN 55416<br><div
class="phone">(952) 922-9640</div><a onclick="ga('send', 'event',
'Locator', 'Clicked Location Website Link', 'http://www.seconddebut.org');"
class="website" href="http://www.seconddebut.org">Visit Website</a></div>
"""
soup = BeautifulSoup(markup, "html.parser")
store = soup.find("div", attrs="class": "contact")
print(list(store.strings)[:2])
Result:
['4300 W 36 1/2 St', 'St Louis Park, MN 55416']
Assuming they all follow the same pattern, something like the below should work:
from bs4 import BeautifulSoup
markup = r"""
<div class="contact">4300 W 36 1/2 St<br>St Louis Park, MN 55416<br><div
class="phone">(952) 922-9640</div><a onclick="ga('send', 'event',
'Locator', 'Clicked Location Website Link', 'http://www.seconddebut.org');"
class="website" href="http://www.seconddebut.org">Visit Website</a></div>
"""
soup = BeautifulSoup(markup, "html.parser")
store = soup.find("div", attrs="class": "contact")
print(list(store.strings)[:2])
Result:
['4300 W 36 1/2 St', 'St Louis Park, MN 55416']
answered Mar 7 at 18:38
codycody
8,19631226
8,19631226
add a comment |
add a comment |
Thanks for contributing an answer to Stack Overflow!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function ()
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fstackoverflow.com%2fquestions%2f55049229%2fdiv-text-not-showing-up-with-selenium-python%23new-answer', 'question_page');
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function ()
StackExchange.helpers.onClickDraftSave('#login-link');
);
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Is this from goodwilleasterseals.org/shop/find-store? The HTML there doesn't look like this. Since it seems to be a public URL, can you post it?
– JeffC
Mar 7 at 17:30
Jeff, I've been using this url goodwill.org/locator but it looks like there are the same locations on the url that you've prodived as that one that I've been using. I'm going to try yours
– jnord
Mar 7 at 18:12
I still don't see the HTML that you posted. Can you edit your question and put in the step by step instructions of how you would manually get to the results page you are trying to scrape?
– JeffC
Mar 8 at 1:19