今天David 9 给大家开放一个很棒的数据集, 收集了迄今为止的phpweekly和pycoders上的所有HTML数据, 不可多得的技术引导.
项目地址:
https://github.com/yanchao727/crawl_pycoder
phpweekly 所有数据路径: https://github.com/yanchao727/crawl_pycoder/tree/master/crawl_pycoder/spiders/php_weekly_dump
pycoders 所有数据路径:
https://github.com/yanchao727/crawl_pycoder/tree/master/crawl_pycoder/spiders/content
The following two tabs change content below.
David 9
邮箱:yanchao727@gmail.com
微信: david9ml
Latest posts by David 9 (see all)
- 修订特征已经变得切实可行, “特征矫正工程”是否会成为潮流? - 27 3 月, 2024
- 量子计算系列#2 : 量子机器学习与量子深度学习补充资料,QML,QeML,QaML - 29 2 月, 2024
- “现象意识”#2:用白盒的视角研究意识和大脑,会是什么景象?微意识,主体感,超心智,意识中层理论 - 16 2 月, 2024