Web Scraping di dati da internet attraverso l'Intelligenza Artificiale

Danè, Diego <2001>

dc.contributor.advisor	Anguita, Davide <1963>
dc.contributor.author	Danè, Diego <2001>
dc.date.accessioned	2023-12-21T15:26:13Z
dc.date.available	2023-12-21T15:26:13Z
dc.date.issued	2023-12-19
dc.identifier.uri	https://unire.unige.it/handle/123456789/7269
dc.description.abstract	Il presente elaborato focalizza l’attenzione sull’utilizzo di ChatGPT, un chatbot che basa il suo funzionamento sul GPT-3.5(ovvero un Large Language Model di ultima generazione), per creare uno strumento per il Web Scraping, una tecnica che si basa sull’identificazione e il rilevamento di dati in automatico, rendendo così il processo di estrazione delle informazioni un processo più rapido ed efficiente. In primo luogo la tesi introduce il concetto di Web Scraping e fornisce un quadro generale sulle caratteristiche e le applicazioni della tecnica di estrazione in vari campi lavorativi. Successivamente vengono trattati i Large Language Models, modelli di intelligenza artificiale che sono capaci di utilizzare il linguaggio naturale per comunicare. Vengono analizzate le caratteristiche di questi modelli IA e in particolar modo si focalizza l’attenzione su ChatGPT, l’esempio più importante che ha alla base un LLM. Dopo aver introdotto ChatGPT si analizzano le varie tecniche che vanno sotto il nome di Prompt Engineering, e come esse, se applicate, possono portare ad un utilizzo più efficiente del chatbot in termini dello sviluppo dello strumento di Web Scraping. L’obiettivo della tesi è quindi quello di dimostrare che ChatGPT può rivelarsi uno strumento di supporto valido quando si parla di estrapolazione di dati, attraverso Scraping, permettendo di risparmiare tempo nella programmazione e nello sviluppo dello Scraper e rendendo più efficiente tutto il processo di analisi di dati.	it_IT
dc.description.abstract	The present paper focuses on the use of ChatGPT, a chatbot that operates on GPT-3.5 (a state-of-the-art Large Language Model), to create a tool for Web Scraping. Web Scraping is a technique based on the automatic identification and detection of data, making the information extraction process faster and more efficient. Firstly, the thesis introduces the concept of Web Scraping and provides a general overview of the characteristics and applications of the extraction technique in various professional fields. Subsequently, Large Language Models are discussed, which are artificial intelligence models capable of using natural language for communication. The features of these AI models are analyzed, with a particular focus on ChatGPT, the most significant example based on a Large Language Model. After introducing ChatGPT, various techniques falling under the umbrella term "Prompt Engineering" are examined. It explores how, if applied, these techniques can lead to a more efficient use of the chatbot in terms of developing the Web Scraping tool. The thesis aims to demonstrate that ChatGPT can be a valuable support tool for data extraction through Scraping, saving time in programming and developing the Scraper, and enhancing the efficiency of the entire data analysis process.	en_UK
dc.language.iso	it
dc.rights	info:eu-repo/semantics/restrictedAccess
dc.title	Web Scraping di dati da internet attraverso l'Intelligenza Artificiale	it_IT
dc.title.alternative	Data Web scraping from internet through Artificial Intelligence	en_UK
dc.type	info:eu-repo/semantics/bachelorThesis
dc.subject.miur	ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
dc.publisher.name	Università degli studi di Genova
dc.date.academicyear	2022/2023
dc.description.corsolaurea	10716 - INGEGNERIA GESTIONALE
dc.description.area	9 - INGEGNERIA
dc.description.department	100025 - DIPARTIMENTO DI INGEGNERIA MECCANICA, ENERGETICA, GESTIONALE E DEI TRASPORTI

Files in this item

Name:: tesi26775516.pdf
Size:: 2.220Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

Laurea Triennale [4409]

Show simple item record