
{"id":2802,"date":"2024-05-23T08:49:21","date_gmt":"2024-05-23T00:49:21","guid":{"rendered":"https:\/\/infernews.com\/archives\/2802"},"modified":"2024-05-23T20:06:44","modified_gmt":"2024-05-23T12:06:44","slug":"scrape-web-with-llama3-ollama","status":"publish","type":"post","link":"https:\/\/infernews.com\/blog\/scrape-web-with-llama3-ollama\/","title":{"rendered":"\u7db2\u7d61\u722c\u87f2 llama3 + Ollama + ScrapeGraphAI"},"content":{"rendered":"<figure class=\"wp-block-embed-youtube wp-block-embed is-type-video is-provider-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"lyte-wrapper\" title=\"Scrape Any Website using llama3+Ollama+ScrapeGraphAI | Fully Local + Free #ai #llm\" style=\"width:853px;max-width:100%;margin:5px auto;\"><div class=\"lyMe\" id=\"WYL_2BTI3KIiGHU\" itemprop=\"video\" itemscope itemtype=\"https:\/\/schema.org\/VideoObject\"><div><meta itemprop=\"thumbnailUrl\" content=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2F2BTI3KIiGHU%2Fhqdefault.jpg\" \/><meta itemprop=\"embedURL\" content=\"https:\/\/www.youtube.com\/embed\/2BTI3KIiGHU\" \/><meta itemprop=\"duration\" content=\"PT13M7S\" \/><meta itemprop=\"uploadDate\" content=\"2024-05-14T12:04:27Z\" \/><\/div><div id=\"lyte_2BTI3KIiGHU\" data-src=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2F2BTI3KIiGHU%2Fhqdefault.jpg\" class=\"pL\"><div class=\"tC\"><div class=\"tT\" itemprop=\"name\">Scrape Any Website using llama3+Ollama+ScrapeGraphAI | Fully Local + Free #ai #llm<\/div><\/div><div class=\"play\"><\/div><div class=\"ctrl\"><div class=\"Lctrl\"><\/div><div class=\"Rctrl\"><\/div><\/div><\/div><noscript><a href=\"https:\/\/youtu.be\/2BTI3KIiGHU\" rel=\"nofollow\"><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/infernews.com\/blog\/wp-content\/plugins\/wp-youtube-lyte\/lyteCache.php?origThumbUrl=https%3A%2F%2Fi.ytimg.com%2Fvi%2F2BTI3KIiGHU%2F0.jpg\" alt=\"Scrape Any Website using llama3+Ollama+ScrapeGraphAI | Fully Local + Free #ai #llm\" width=\"853\" height=\"460\" \/><br \/>Watch this video on YouTube<\/a><\/noscript><meta itemprop=\"description\" content=\"In a constantly evolving web landscape, ScrapeGraphAI introduces a new era of web scraping. This open-source library leverages Large Language Models (LLMs) to offer flexible and low-maintenance scraping solutions for developers. ScrapeGraphAI is a web scraping Python library that uses LLM and direct graph logic to create scraping pipelines for websites and local documents (XML, HTML, JSON, etc.). The SmartScraperGraph class represents one of the default scraping pipelines, utilizing a direct graph implementation where each node has its own function\u2014from retrieving HTML from a website to extracting relevant information based on your query and generating a coherent answer. In this video, we explore how to scrape website content using LLaMA3, Ollama, and ScrapeGraphAI, all running locally. Note that a minimum of 15GB of RAM is required for this application. This approach can also be applied to your local documents such as XML, HTML, and more. Let&#039;s dive into it! #WebScraping #PythonLibrary #ScrapeGraphAI #LLM #OpenSource #Tutorial #DataExtraction #LocalDocuments Blog :https:\/\/www.dataedgehub.com LINKS: Code :https:\/\/www.dataedgehub.com\/2024\/07\/Webscraping-locallm.html GitHub Code: https:\/\/github.com\/InsightEdge01\/ScrapegraphAIOllamallama3 Ollama download:https:\/\/ollama.com\/download ScrapegraphAI:https:\/\/scrapegraph-ai.readthedocs.io\/en\/latest\/introduction\/overview.html ScrapeGraphAI Github: https:\/\/github.com\/VinciGit00\/Scrapegraph-ai?tab=readme-ov-file https:\/\/www.youtube.com\/watch?v=FiCsuN7aPF8 https:\/\/www.youtube.com\/watch?v=QmTtU-qbjUA&amp;t=187s https:\/\/www.youtube.com\/watch?v=WjoTAzuf1Dg&amp;t=373s\"><\/div><\/div><div class=\"lL\" style=\"max-width:100%;width:853px;margin:5px auto;\"><\/div><figcaption><\/figcaption><\/figure>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"googlesitekit_rrm_CAowvqSiDA:productID":"","footnotes":""},"categories":[27],"tags":[],"class_list":["post-2802","post","type-post","status-publish","format-standard","hentry","category-paper"],"_links":{"self":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/2802","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/comments?post=2802"}],"version-history":[{"count":0,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/posts\/2802\/revisions"}],"wp:attachment":[{"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/media?parent=2802"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/categories?post=2802"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/infernews.com\/blog\/wp-json\/wp\/v2\/tags?post=2802"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}