网络设计到的攻击层面_看b站播放温柔乡_ppt软件免费下载安装_设计企业网页公司 = []def write_to_mongo(ips, city):'''将数据写入mongoDB'''client = Client(host='localhost', port=27017)db = client['fs_db']coll = db[city + '_good']for ip in ips:coll.insert_one({'name': ip[0], \'price': ip[1],'addresses': ip[2],'areas': ip[3],'eq': ip[4]})client.close()def read_from_mongo(city):client = Client(host='localhost', port=27017)db = client['fs_db']coll = db[city + '_good']li = coll.find()client.close()return liclass Consumer(threading.Thread):def __init__(self, args):threading.Thread.__init__(self, args=args)def run(self):global is_crawurl_demo, i, city_id, lock = self._argsprint("{}, 第{}页".format(city[city_id], i))url = url_demo.format(i)soup = get_real(url)names = []for name in soup.select('.tit_shop'):names.append(name.text.strip())addresses = []for item in soup.find_all('p', attrs={'class': 'add_shop'}):address = item.a.text + " " + item.span.textaddresses.append(address.replace('\t', '').replace('\n', ''))es = []for item in soup.find_all('p', attrs={'class': 'tel_shop'}):es.append(item.text.replace('\t', '').replace('\n', ''))moneys = []for money in soup.find_all("span", attrs={"class": 'red'}):moneys.append(money.text.strip())areas = []for area in soup.find_all('dd', attrs={'class': 'price_right'}):areas.append(area.find_all('span')[-1].text)houses = []for idx in range(len(names)):try:item = [names[idx], moneys[idx], addresses[idx], areas[idx], es[idx]]print(item)houses.append(item)except Exception as e:print(e)lock.acquire()write_to_mongo(houses, e_city[city_id])lock.release()print("线程结束{}".format(i))def dict2proxy(dic):s = dic['type'] + '://' + dic['ip'] + ':' + str(dic['port'])return {'http': s, 'https': s}def get_real(url):resp = requests.get(url, headers=header)soup = BeautifulSoup(resp.content, 'html.parser', from_encoding='gb18030')if soup.find('title').text.strip() == '跳转...':pattern1 = re.compile(r"var t4='(.*?)';")script = soup.find("script", text=pattern1)t4 = pattern1.search(str(script)).group(1)pattern1 = re.compile(r"var t3='(.*?)';")script = soup.find("script", text=pattern1)t3 = re.findall(pattern1, str(script))[-2]url = t4 + '?' + t3HTML = requests.get(url, headers=header)soup = BeautifulSoup(HTML.content, 'html.parser', from_encoding='gb18030')elif soup.find('title').text.strip() == '访问验证-房天下':passreturn soupdef read_proxies():client = Client(host='localhost', port=27017)db = client['proxies_db']coll = db['proxies']# 先检测,再写入,防止重复dic = list(coll.find())client.close()return dicdef craw():lock = threading.Lock()for idx in trange(len(e_city)):url = eshouse[idx]soup = get_real(url.format(2))try:page_number = int(soup.find('div', attrs={'class': 'page_al'}).find_all('span')[-1].text[1:-1])pages = list(range(1, page_number + 1))except:pages = list(range(1, 101))url_demo = urlts = []# pages = [1, 2, 3]while len(pages) != 0:for i in range(10):t = Consumer((url_demo, pages.pop(), idx, lock))t.start()ts.append(t)if len(pages) == 0:breakfor t in ts:t.join()ts.remove(t)if __name__ == '__main__':craw()">

做网页很难吗

长沙建设外贸网站、2021主流的网页设计软件、永久使用、不限域名、广东网络优化推广价格

彩虹网站资源?

网络设计到的攻击层面_看b站播放温柔乡_ppt软件免费下载安装_设计企业网页公司

HTML的全称为HyperText Markup Language,即超文本标记语言。它是一种用于创建可从一个平台移植到另一平台的超文本文档的简单标记语言,经常用来创建web页面。HTML文本是由HTML命令组成的描述性文本,HTML命令可以说明文字、图形、动画、声音、表格、链接等。

中华建设期刊官网域名怎么注册好广东网站开发建设网站页面设计有哪些板块?物流网站 源码智慧团建网站官网打不开绍兴seo推广公司维骐seo联通网络优化主要干什么网络营销服务合同搭建网站显示404怎么解决前端面试一问三不知怎么办北京海淀做网站公司哪家比较好女孩短期技能培训班怎么把东西放到网上卖上线了在线建站阳江房产登记中心官网专注建站定制网络工程师学习路线香港服务器空间职业生涯规划大赛ppt模板免费长春网站优化如何公司网络建设方案模板公司注册的具体流程网页制作图片加边框wap网页游戏源码北京ui设计师工资一般多少广州有几个区县燃气公司电话网站建设公司有哪些知名的怎么制作微信小程序教程

猜你喜欢

  • 友情链接:
  • 自己的品牌怎么做加盟推广 陕西住建网官网 极速浏览器 流氓软件游戏下载app大全免费下载 微博营销的七种模式 电商购物app排行榜