Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

【知乎专栏】知乎专栏再次出现中文乱码 #16210

Open
1 task done
ghost opened this issue Jul 20, 2024 · 17 comments
Open
1 task done

【知乎专栏】知乎专栏再次出现中文乱码 #16210

ghost opened this issue Jul 20, 2024 · 17 comments
Labels
RSS bug Something isn't working

Comments

@ghost
Copy link

ghost commented Jul 20, 2024

路由地址

/zhihu/zhuanlan/:id

完整路由地址

/zhihu/zhuanlan/:id

相关文档

https://docs.rsshub.app/routes/social-media#%E4%B8%93%E6%A0%8F

预期是什么?

中文

实际发生了什么?

中文乱码

部署

RSSHub 演示 (https://rsshub.app)

部署相关信息

No response

额外信息

see #15913 #15944

这不是重复的 issue

  • 我已经搜索了 现有 issue,以确保该错误尚未被报告。
@ghost ghost added the RSS bug Something isn't working label Jul 20, 2024
Copy link
Contributor

Searching for maintainers:

To maintainers: if you are not willing to be disturbed, list your username in scripts/workflow/test-issue/call-maintainer.js. In this way, your username will be wrapped in an inline code block when tagged so you will not be notified.

If all routes can not be found, the issue will be closed automatically. Please use NOROUTE for a route-irrelevant issue or leave a comment if it is a mistake.
如果所有路由都无法匹配,issue 将会被自动关闭。如果 issue 和路由无关,请使用 NOROUTE 关键词,或者留下评论。我们会重新审核。

1 similar comment

This comment was marked as duplicate.

@ghost ghost changed the title 【知乎专栏】知乎专栏抓取出现中文乱码 【知乎专栏】知乎专栏再次出现中文乱码 Jul 20, 2024

This comment was marked as duplicate.

@dzx-dzx
Copy link
Contributor

dzx-dzx commented Jul 28, 2024

@felixjichao
Copy link

https://github.com/GuanFoxyier/allMycodes/blob/2df1ff8208e2e41b3b90768daf251c425a793503/zhihu/zhihu_webpack/__zse_ck.js

在本地可以正常生产随机的值,但是请求知乎,还是返回的乱码。

curl 'https://zhuanlan.zhihu.com/p/597586623' \
  -H 'accept: text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7' \
  -H 'accept-language: zh-CN,zh;q=0.9' \
  -H 'cache-control: no-cache' \
  -H 'cookie: __zse_ck=001_F3X=MvcD8wL3cBVPN0d8zj7GVUIqgsyEzahavGOSNkTdDIlitFZqG6ziwz/Pc1GJF6jO17VVASbHqcBsFcZtNtJYBbCPbYG7G4Fb/uiFCg4MI68jP88gxkTBLBaqPMNK' \
  -H 'pragma: no-cache' \
  -H 'priority: u=0, i' \
  -H 'referer: https://zhuanlan.zhihu.com/p/597586623' \
  -H 'sec-ch-ua: "Not)A;Brand";v="99", "Microsoft Edge";v="127", "Chromium";v="127"' \
  -H 'sec-ch-ua-mobile: ?0' \
  -H 'sec-ch-ua-platform: "macOS"' \
  -H 'sec-fetch-dest: document' \
  -H 'sec-fetch-mode: navigate' \
  -H 'sec-fetch-site: same-origin' \
  -H 'upgrade-insecure-requests: 1' \
  -H 'user-agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/126.0.0.0 Safari/537.36'````

<!doctype html>

<title data-rh="true">练昌AGI渴颜:睁屋恤辫幔栈(LLM)映酬芳集 - 知乎</title>```

@dzx-dzx
Copy link
Contributor

dzx-dzx commented Aug 7, 2024

太神秘了……
@felixjichao User Agent可能是cookie计算的一部分,也许应该和代码里一致。

@klaus2015
Copy link

User Agent和js代码里一致,看起来还是乱码了。。

@ghost
Copy link
Author

ghost commented Oct 25, 2024

知乎关怀版貌似还是正常的,是否有大佬愿意试试?

@WindowsXp-Beta
Copy link

似乎不止是专栏,answers,posts都有乱码问题。

@WindowsXp-Beta
Copy link

update: xhu的link有乱码,原本的link没有乱码。

@ghost
Copy link
Author

ghost commented Nov 3, 2024

似乎是cookie后有乱码。

否有可能检测乱码(低频汉字出现次数过高),自动fallback到正常版本?

@notlsd
Copy link

notlsd commented Nov 6, 2024

Update: 动态无乱码 / 回答有乱码,非常诡异。

@KwToPA
Copy link

KwToPA commented Dec 7, 2024

似乎是cookie后有乱码。

否有可能检测乱码(低频汉字出现次数过高),自动fallback到正常版本?

@WindowsXp-Beta

问客服,系统检测到您的账号有异常使用行为 第一次乱码 24小时,若在此检测异常,乱码时长会增加

是知乎服务器端的操作,我用网页版也是乱码 加cookies 后出现的

抓取太多被定位到账户了

@KwToPA
Copy link

KwToPA commented Dec 7, 2024

#17581 (comment)

https://rsshub.app/zhihu/xhu/people/activities/246e6cf44e94cefbf4b959cb5042bc91

目前rsshub官方的 /zhihu/xhu 这个路由正常工作,我自建的就是

Error Message:
FetchError: [GET] "https://api.zhihuvvv.workers.dev/guests/token": 401 Unauthorized

@YukiSou
Copy link

YukiSou commented Jan 15, 2025

Helpful Information

Error Message:
FetchError: [GET] "https://api.zhihuvvv.workers.dev/people/246e6cf44e94cefbf4b959cb5042bc91/activities?before_id=0&limit=20": 403 Forbidden

Route: /zhihu/xhu/people/activities/:hexId

Full Route: /zhihu/xhu/people/activities/246e6cf44e94cefbf4b959cb5042bc91

Node Version: v22.13.0

Git Hash: 69f7d21

Git Date: Wed, 15 Jan 2025 06:29:34 GMT
官网路由也403了

@KwToPA
Copy link

KwToPA commented Jan 18, 2025

@YukiSou 现在的最新状况是,网页端无乱码,但一直让我点击验证,不然无法浏览。 rss路由则是偶尔全封,一天内全解禁。 需要包含z_c0的cookie

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
RSS bug Something isn't working
Projects
None yet
Development

No branches or pull requests

7 participants