Rogue Scholar

Published May 13, 2024

Automated Knowledge Graph Construction with Large Language Models — Part 2 Harvesting the Power and Knowledge of Large Language Models Author Amanda Kau ( ORCID : 0009–0004–4949–9284 ) Introduction Knowledge graphs (KGs) are a structured representation of data in a graphical format, in which entities are represented by nodes and are connected by edges representing relationships

OllamaAiTaggingComputer and Information Sciences

How to use Large Language Models to tag your data: A complete tutorial

https://doi.org/10.59350/z1z3k-rrm02

Published May 13, 2024

Author Xuzeng He

Using Mistral for Data tagging Author · Xuzeng He ( ORCID: 0009–0005–7317–7426) Introduction Data tagging, in simple terms, is the process of assigning labels or tags to your data so that they are easier to retrieve or analyse.

MegalodonLong-textsTransformer-architectureComputer and Information Sciences

The longer the context, the better? Unlimited Context Length in Megalodon

https://doi.org/10.59350/dx6a6-yy475

Published May 7, 2024

Author Qingqin Fang

An improvement architecture superior to the Transformer, proposed by Meta Author · Qingqin Fang ( ORCID: 0009–0003–5348–4264) Introduction Recently, researchers from Meta and the University of Southern California have introduced a model called Megalodon. They claim that this model can expand the context window of language models to handle millions of tokens without overwhelming your memory.

Large-language-modelsArtificial-intelligenceTransformersNatural-language-processComputer and Information Sciences

Brief Introduction to the History of Large Language Models (LLMs)

https://doi.org/10.59350/m4c7t-epg97

Published May 7, 2024

Author Wenyi Pi

Understanding the Evolutionary Journey of LLMs Author Wenyi Pi ( ORCID : 0009–0002–2884–2771) Introduction When we talk about large language models (LLMs), we are actually referring to a type of advanced software that can communicate in a human-like manner. These models have the amazing ability to understand complex contexts and generate content that is coherent and has a human feel.

ModularNaiveAdvancedRetrieval-augmented-genComputer and Information Sciences

Three Paradigms of RAG

https://doi.org/10.59350/5j7tt-5y328

Published May 7, 2024

Author Vaibhav Khobragade

From Naive to Modular: Tracing the Evolution of Retrieval-Augmented Generation Author · Vaibhav Khobragade ( ORCID: 0009–0009–8807–5982) Introduction Large Language Models (LLMs) have achieved remarkable success.

Large-language-modelsRlhfFine-tuningComputer and Information Sciences

Fine-tuning Large Language Models: A Brief Introduction

https://doi.org/10.59350/1aezq-kk827

Published May 7, 2024

Author Xuzeng He

Supervised Fine-tuning, Reinforcement Learning from Human Feedback and the latest SteerLM Author · Xuzeng He ( ORCID: 0009–0005–7317–7426) Introduction Large Language Models (LLMs), usually trained with extensive text data, can demonstrate remarkable capabilities in handling various tasks with state-of-the-art performance. However, people nowadays typically want something more personalised instead of a general solution.

Natural-language-processiTransformersArtificial-intelligenceComputer and Information Sciences

Transformers Models in NLP

https://doi.org/10.59350/c7nrg-xay43

Published May 7, 2024

Author Dhruv Gupta

Attention mechanism not getting enough attention Author Dhruv Gupta ( ORCID : 0009–0004–7109–5403) Introduction As discussed in this article, RNNs were incapable of learning long-term dependencies. To solve this issue both LSTMs and GRUs were introduced. However, even though LSTMs and GRUs did a fairly decent job for textual data they did not perform well.

Fake-newsArtificial-intelligenceLarge-language-modelsComputer and Information Sciences

Are Large Language Models Our Allies or Enemies in the Fight Against Fake News?

https://doi.org/10.59350/st0jr-ad818

Published May 7, 2024

Author Amanda Kau

Large Language Models for Fake News Generation and Detection Author Amanda Kau ( ORCID : 0009–0004–4949–9284) Introduction In recent years, fake news has become an increasing concern for many, and for good reason. Newspapers, which we once trusted to deliver credible news through accountable journalists, are vanishing en masse along with their writers.

NaturallanguageprocessingLstmArtificial-intelligenceRecurrent-neural-networkComputer and Information Sciences

RNNs vs GRUs vs LSTMs

https://doi.org/10.59350/t6mga-7zd77

Published April 30, 2024

Author Dhruv Gupta

The Three Oldest Pillars of NLP Author Dhruv Gupta ( ORCID : 0009–0004–7109–5403) Introduction Natural Language Processing (NLP) has almost become synonymous with Large Language Models (LLMs), Generative AI, and fancy chatbots. With the ever-increasing amount of textual data and exponential growth in computational knowledge, these models are improving every day.

Large-language-modelsFrameworkRetrieval-augmentedComputer and Information Sciences

RAG 2.0 is Coming?

https://doi.org/10.59350/6frhg-zxp80

Published April 30, 2024

Author Qingqin Fang

A Unified and Collaborative Framework for LLM Author · Qingqin Fang ( ORCID: 0009–0003–5348–4264) Introduction In today’s rapidly evolving field of artificial intelligence, large language models (LLMs) are demonstrating unprecedented potential. Particularly, the Retrieval-Augmented Generation (RAG) architecture has become a hot topic in AI technology due to its unique technical capabilities.

Stories by Research Graph on Medium

Automated Knowledge Graph Construction with Large Language Models — Part 2

How to use Large Language Models to tag your data: A complete tutorial

The longer the context, the better? Unlimited Context Length in Megalodon

Brief Introduction to the History of Large Language Models (LLMs)

Three Paradigms of RAG

Fine-tuning Large Language Models: A Brief Introduction

Transformers Models in NLP

Are Large Language Models Our Allies or Enemies in the Fight Against Fake News?

RNNs vs GRUs vs LSTMs

RAG 2.0 is Coming?