Skip to content
Projects
Groups
Snippets
Help
This project
Loading...
Sign in / Register
Toggle navigation
S
spider
Overview
Overview
Details
Activity
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
selection-new
spider
Commits
cce73649
Commit
cce73649
authored
Jan 21, 2026
by
Peng
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
增加清除浏览器缓存
parent
d4dde086
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
13 additions
and
12 deletions
+13
-12
H10_spider.py
py_spider/amazon_spider/H10_spider.py
+13
-12
No files found.
py_spider/amazon_spider/H10_spider.py
View file @
cce73649
...
...
@@ -439,7 +439,7 @@ class H10():
return
False
def
webdrvier_html
(
self
,
asin
,
asinstype
):
# 点击选择站点
for
i
in
range
(
5
):
for
i
in
range
(
6
):
try
:
_url
=
self
.
driver
.
current_url
if
"concurrent-sessions"
in
_url
or
'signin'
in
_url
:
...
...
@@ -447,7 +447,8 @@ class H10():
if
asin
not
in
self
.
err_asin_list
and
self
.
useremail_state
:
print
(
'cerebro界面'
,
self
.
site_name_url
)
self
.
driver
.
get
(
f
'https://members.helium10.com/cerebro?accountId={self.account_id}'
)
if
not
self
.
wait_page
(
timeout
=
15
):
if
not
self
.
wait_page
(
timeout
=
50
):
self
.
driver
.
refresh
()
print
(
'页面未加载出来'
)
continue
time
.
sleep
(
2
)
...
...
@@ -622,7 +623,7 @@ class H10():
# 点击添加
self
.
driver
.
execute_script
(
"""document.querySelector("button[data-testid='applyfilters']").click()"""
)
time
.
sleep
(
3
)
time
.
sleep
(
5
)
# 下载报告
# 点击下载csv按钮
self
.
driver
.
execute_script
(
...
...
@@ -1109,7 +1110,10 @@ class H10():
self
.
driver
.
execute_script
(
"caches.keys().then(function(names) { for (let name of names) { caches.delete(name); } });"
)
self
.
driver
.
execute_script
(
"window.performance.clearResourceTimings();"
)
time
.
sleep
(
2
)
self
.
driver
.
execute_cdp_cmd
(
"Network.clearBrowserCache"
,
{})
# 2) (可选)清性能 timings 不影响登录
self
.
driver
.
execute_script
(
"window.performance.clearResourceTimings();"
)
time
.
sleep
(
5
)
login_url
=
self
.
driver
.
current_url
if
"concurrent-sessions"
in
login_url
or
'signin'
in
login_url
:
self
.
longin
()
...
...
@@ -1172,15 +1176,12 @@ class H10():
self
.
mysql_connect
(
site
)
time
.
sleep
(
randint
(
20
,
50
))
new_date
=
datetime
.
datetime
.
now
()
.
strftime
(
"
%
H"
)
print
(
new_date
)
for
i
in
range
(
10
):
if
new_date
==
'22'
:
time
.
sleep
(
3800
)
new_date
=
datetime
.
datetime
.
now
()
.
strftime
(
"
%
H"
)
print
(
new_date
)
if
new_date
==
'08'
:
self
.
driver
.
refresh
()
print
(
f
"当前时间 {datetime.datetime.now().hour} 点,超出运行时段退出循环。"
)
hour
=
datetime
.
datetime
.
now
()
.
hour
if
hour
<=
22
or
hour
>
8
:
break
time
.
sleep
(
3800
)
if
__name__
==
'__main__'
:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment