Timeout error when scraping Capology #46

c10vis · 2024-08-16T22:26:47Z

ScraperFC version: 3.1.0
Selenium version: 4.23.1

As normal, I import ScraperFC, initialize the Capology scraper per the documentation, and attempt to scrape EPL data from 2023-24. The result is a timeout error (see photo below). I have tried various seasons and leagues with similar results. I have been able to scrape from other modules in ScraperFC with no problems. I have also made a Capology account and logged in in-browser; this has not changed my results.

oseymour · 2024-08-20T03:40:05Z

Hey @c10vis I just tried this locally and it worked. No error. Not sure what's going on with yours. Are you still getting the error?

c10vis · 2024-08-20T05:11:18Z

Yeah, still getting the error. I wonder if it has to do with Chrome? I'm not sure how the backend works but seems like the scraper is using a chrome driver that's causing an issue. I don't know if that's something specific to how I'm set up or just generally how it works.

oseymour · 2024-08-22T04:25:28Z

I doubt it's a chrome issue. You're correct, Selenium creates a chromedriver (essentially just a chrome window) and using that avoids a lot of anti-scraping measures vs. just doing an HTTP request with requests. Using a chromedriver also allows for interacting with the webpage (e.g., changing currency on Capology).

I can't do much to debug this without having the issue myself. You could try increasing the timeout duration.

What OS are you using? Is this running on your laptop or a remote machine/server?

c10vis · 2024-08-24T05:10:09Z

I'm running MacOS 14.6.1 on my personal laptop (M1 MacBook Pro).

I timed the issue and it runs for about 1 min 20s before the timeout error hits. How would I go about increasing the timeout duration?

oseymour · 2024-09-07T12:27:52Z

Sorry for the delay. Was moving in with my girlfriend.

What you measured is the time to hit that error (I assume). The timeout duration for finding the element that is failing is 10 seconds. You'll need to go to where python downloads packages when it pip installs them and increase the 10 to something else in the .py file. I don't know where that is on macOS though. Google should be able to tell you. And just follow the error trace you get to see which line needs to be edited.

oseymour · 2024-11-07T04:44:17Z

Hey @c10vis I was finally able to reproduce this issue on my machine and I've got a fix coming! Running tests on it now. I'm travelling this weekend but should be able to get it committed when I get back.

oseymour closed this as completed in f0750cd Nov 12, 2024

oseymour added the bug Something isn't working label Nov 15, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Timeout error when scraping Capology #46

Timeout error when scraping Capology #46

c10vis commented Aug 16, 2024

oseymour commented Aug 20, 2024

c10vis commented Aug 20, 2024

oseymour commented Aug 22, 2024

c10vis commented Aug 24, 2024

oseymour commented Sep 7, 2024

oseymour commented Nov 7, 2024

Timeout error when scraping Capology #46

Timeout error when scraping Capology #46

Comments

c10vis commented Aug 16, 2024

oseymour commented Aug 20, 2024

c10vis commented Aug 20, 2024

oseymour commented Aug 22, 2024

c10vis commented Aug 24, 2024

oseymour commented Sep 7, 2024

oseymour commented Nov 7, 2024