TechCrunch AI • 1일 전

AI 용어만 겉도는 당신을 위한 완벽 정리

IMP

5/10

핵심 요약

쏟아지는 AI 관련 용어들로 인해 혼란을 겪는 독자들을 위해, 테크크런치가 주요 AI 용어들의 명확한 정의를 정리한 가이드를 발표했습니다. 이 글은 인공지능, AI 에이전트, API 엔드포인트 등 실무와 개발 현장에서 필수적으로 이해해야 하는 개념들을 쉽게 풀어 설명하고 있습니다. 급변하는 AI 산업 생태계를 파악하고 기술 트렌드를 읽기 위해 매우 유용한 기초 자료입니다.

번역된 본문

인공지능은 세상을 변화시키고 있으며, 동시에 그 변화를 설명하기 위한 완전히 새로운 언어를 만들어내고 있습니다. AI에 관해 5분만 읽어보아도 LLM, RAG, RLHF 등 기술 업계의 똑똑한 전문가들조차 자신감이 떨어지게 만드는 수많은 용어들을 마주하게 됩니다. 이 용어집은 그러한 혼란을 해소하기 위해 마련되었습니다. 우리는 이 분야가 발전함에 따라 이 문서를 정기적으로 업데이트하므로, 이 문서가 설명하는 AI 시스템처럼 계속 살아 숨 쉬는 문서로 생각해 주시기 바랍니다.

AGI 범인공지능(AGI, Artificial general intelligence)은 다소 모호한 용어입니다. 하지만 일반적으로 대부분의 작업에서 평범한 인간보다 더 뛰어난 능력을 갖춘 AI를 의미합니다. OpenAI의 샘 알트만 CEO는 한때 AGI를 "동료로 고용할 수 있는 평균적인 인간과 같은 수준"이라고 설명했습니다. 반면 OpenAI의 헌장에서는 AGI를 "경제적으로 가치 있는 대부분의 작업에서 인간을 능가하는 고도로 자율적인 시스템"으로 정의하고 있습니다. 구글 딥마인드(Google DeepMind)의 정의는 이 두 가지와 약간 다릅니다. 이 연구소는 AGI를 "대부분의 인지적 작업에서 최소한 인간만큼 유능한 AI"로 간주합니다. 혼란스럽나요? 걱정하지 마세요. AI 연구 최전선에 있는 전문가들 역시 마찬가지이니까요.

AI 에이전트 (AI agent) AI 에이전트는 기존의 기본적인 AI 챗봇이 할 수 있는 것을 넘어, 경비 처리, 항공권이나 식당 예약, 심지어 코드 작성 및 유지보수 등 사용자를 대신해 일련의 작업을 수행하는 AI 기술 기반 도구를 뜻합니다. 하지만 앞서 설명한 것처럼, 이 새롭게 떠오르는 분야에는 변화하는 요소들이 많아 'AI 에이전트'라는 단어가 사람마다 다른 의미로 쓰일 수 있습니다. 또한 기대했던 기능들을 온전히 제공하기 위한 인프라 역시 아직 구축 중입니다. 하지만 기본적인 개념은 여러 단계의 작업을 수행하기 위해 여러 AI 시스템을 활용할 수 있는 자율적인 시스템을 의미합니다.

API 엔드포인트 (API endpoints) API 엔드포인트는 소프트웨어 뒷면에 있는 '버튼'과 같아서, 다른 프로그램이 그 버튼을 눌러 소프트웨어가 특정 작업을 수행하게 만들 수 있다고 생각하면 됩니다. 개발자들은 이러한 인터페이스를 사용해 연동 기능을 구축합니다. 예를 들어, 한 애플리케이션이 다른 애플리케이션의 데이터를 끌어오거나, 사람이 직접 조작할 필요 없이 AI 에이전트가 제3자 서비스를 직접 제어할 수 있도록 하는 방식입니다. 대부분의 스마트 홈 기기와 연결된 플랫폼에는 일반 사용자가 보거나 상호작용할 일은 없어도 이러한 숨겨진 버튼들이 존재합니다. AI 에이전트가 더욱 발전함에 따라, 에이전트가 스스로 이러한 엔드포인트를 찾고 활용할 수 있게 되었고 이는 강력하면서도 때로는 예상치 못한 자동화의 가능성을 열어주고 있습니다.

사고의 연쇄 (Chain of thought) 간단한 질문이 주어지면 인간의 뇌는 깊게 생각하지 않고도 즉시 대답할 수 있습니다. 예를 들어 "기린과 고양이 중 어떤 동물이 더 키가 큰가?" 같은 질문입니다. 하지만 많은 경우 중간 단계가 필요하기 때문에 펜과 종이를 꺼내 정답을 도출해야 합니다. 예를 들어, 한 농부에게 닭과 소가 있고, 머리는 40개, 다리는 120개라고 할 때 정답(닭 20마리, 소 20마리)을 계산하기 위해 간단한 방정식을 적어야 할 것입니다. AI 분야에서 대규모 언어 모델(LLM)의 '사고의 연쇄(Chain-of-thought)' 추론은 문제를 더 작은 중간 단계로 나누어 최종 결과의 품질을 향상시키는 것을 의미합니다. 정답을 얻는 데 일반적으로 시간이 조금 더 걸리지만, 특히 논리나 코딩 분야에서 정답을 맞출 확률이 훨씬 높아집니다. 추론 모델은 기존 대규모 언어 모델에서 파생되었으며, 강화 학습 덕분에 이러한 사고의 연쇄 방식에 최적화되었습니다. (참조: 대규모 언어 모델)

코딩 에이전트 (Coding agents) 이것은 프로그램이 목표를 달성하기 위해 단계별로 스스로 행동할 수 있다는 'AI 에이전트'의 더 구체적인 개념입니다. 코딩 에이전트는 소프트웨어 개발에 적용되는 특화된 버전을 말합니다. 인간이 검토하고 복사해 붙여넣을 수 있도록 코드를 단순히 추천하는 수준을 넘어, 코딩 에이전트는 코드를 자율적으로 작성하고 테스트하며 디버깅까지 진행합니다. 개발자의 하루를 차지하는 반복적이고 시행착오가 필요한 작업들을 알아서 처리해 주는 것입니다. 이러한 에이전트는 전체 코드베이스를 살펴보며 버그를 찾고, 테스트를 실행하고, 최소한의 인간 개입만으로 수정된 코드를 적용할 수 있습니다. 이것은 마치 고도로 발전한...

원문 보기

원문 보기 (영어)

Artificial intelligence is changing the world, and simultaneously inventing a whole new language to describe how it's doing it. Spend five minutes reading about AI and you'll run into LLMs, RAG, RLHF, and a dozen other terms that can make even very smart people in the tech world feel insecure. This glossary is our attempt to fix that. We update it regularly as the field evolves, so consider it a living document, much like the AI systems it describes. AGI Artificial general intelligence, or AGI, is a nebulous term. But it generally refers to AI that’s more capable than the average human at many, if not most, tasks. OpenAI CEO Sam Altman once described AGI as the “equivalent of a median human that you could hire as a co-worker .” Meanwhile, OpenAI’s charter defines AGI as “highly autonomous systems that outperform humans at most economically valuable work.” Google DeepMind’s understanding differs slightly from these two definitions; the lab views AGI as “AI that's at least as capable as humans at most cognitive tasks.” Confused? Not to worry — so are experts at the forefront of AI research . AI agent An AI agent refers to a tool that uses AI technologies to perform a series of tasks on your behalf — beyond what a more basic AI chatbot could do — such as filing expenses, booking tickets or a table at a restaurant, or even writing and maintaining code. However, as we’ve explained before , there are lots of moving pieces in this emergent space, so "AI agent" might mean different things to different people. Infrastructure is also still being built out to deliver on its envisaged capabilities. But the basic concept implies an autonomous system that may draw on multiple AI systems to carry out multistep tasks. API endpoints Think of API endpoints as "buttons" on the back of a piece of software that other programs can press to make it do things. Developers use these interfaces to build integrations — for example, allowing one application to pull data from another, or enabling an AI agent to control third-party services directly without a human manually operating each interface. Most smart home devices and connected platforms have these hidden buttons available, even if ordinary users never see or interact with them. As AI agents grow more capable, they are increasingly able to find and use these endpoints on their own, opening up powerful — and sometimes unexpected — possibilities for automation. Chain of thought Given a simple question, a human brain can answer without even thinking too much about it — things like "which animal is taller, a giraffe or a cat?" But in many cases, you often need a pen and paper to come up with the right answer because there are intermediary steps. For instance, if a farmer has chickens and cows, and together they have 40 heads and 120 legs, you might need to write down a simple equation to come up with the answer (20 chickens and 20 cows). In an AI context, chain-of-thought reasoning for large language models means breaking down a problem into smaller, intermediate steps to improve the quality of the end result. It usually takes longer to get an answer, but the answer is more likely to be correct, especially in a logic or coding context. Reasoning models are developed from traditional large language models and optimized for chain-of-thought thinking thanks to reinforcement learning. (See: Large language model ) Coding agents This is a more specific concept that an "AI agent," which means a program that can take actions on its own, step by step, to complete a goal. A coding agent is a specialized version applied to software development. Rather than simply suggesting code for a human to review and paste in, a coding agent can write, test, and debug code autonomously, handling the kind of iterative, trial-and-error work that typically consumes a developer's day. These agents can operate across entire codebases, spotting bugs, running tests, and pushing fixes with minimal human oversight. Think of it like hiring a very fast intern who never sleeps and never loses focus — though, as with any intern, a human still needs to review the work. Compute Although somewhat of a multivalent term, compute generally refers to the vital computational power that allows AI models to operate. This type of processing fuels the AI industry, giving it the ability to train and deploy its powerful models. The term is often a shorthand for the kinds of hardware that provides the computational power — things like GPUs, CPUs, TPUs, and other forms of infrastructure that form the bedrock of the modern AI industry. Deep learning A subset of self-improving machine learning in which AI algorithms are designed with a multi-layered, artificial neural network (ANN) structure. This allows them to make more complex correlations compared to simpler machine learning-based systems, such as linear models or decision trees. The structure of deep learning algorithms draws inspiration from the interconnected pathways of neurons in the human brain. Deep learning AI models are able to identify important characteristics in data themselves, rather than requiring human engineers to define these features. The structure also supports algorithms that can learn from errors and, through a process of repetition and adjustment, improve their own outputs. However, deep learning systems require a lot of data points to yield good results (millions or more). They also typically take longer to train compared to simpler machine learning algorithms — so development costs tend to be higher. (See: Neural network ) Diffusion Diffusion is the tech at the heart of many art-, music-, and text-generating AI models. Inspired by physics, diffusion systems slowly “destroy” the structure of data — for example, photos, songs, and so on — by adding noise until there’s nothing left. In physics, diffusion is spontaneous and irreversible — sugar diffused in coffee can’t be restored to cube form. But diffusion systems in AI aim to learn a sort of “reverse diffusion” process to restore the destroyed data, gaining the ability to recover the data from noise. Distillation Distillation is a technique used to extract knowledge from a large AI model with a ‘teacher-student’ model. Developers send requests to a teacher model and record the outputs. Answers are sometimes compared with a dataset to see how accurate they are. These outputs are then used to train the student model, which is trained to approximate the teacher’s behavior. Distillation can be used to create a smaller, more efficient model based on a larger model with a minimal distillation loss. This is likely how OpenAI developed GPT-4 Turbo, a faster version of GPT-4. While all AI companies use distillation internally, it may have also been used by some AI companies to catch up with frontier models. Distillation from a competitor usually violates the terms of service of AI API and chat assistants. Fine-tuning This refers to the further training of an AI model to optimize performance for a more specific task or area than was previously a focal point of its training — typically by feeding in new, specialized (i.e., task-oriented) data. Many AI startups are taking large language models as a starting point to build a commercial product but are vying to amp up utility for a target sector or task by supplementing earlier training cycles with fine-tuning based on their own domain-specific knowledge and expertise. (See: Large language model [LLM] ) GAN A GAN, or Generative Adversarial Network, is a type of machine learning framework that underpins some important developments in generative AI when it comes to producing realistic data — including (but not only) deepfake tools. GANs involve the use of a pair of neural networks, one of which draws on its training data to generate an output that is passed to the other model to evaluate. The two models are essentially programmed to try to outdo each other. The generator is trying to get its output past the di