Meta 2.2: Bases de datos
Introduction to CAT Tools and Terminology Management
Overview of Previous Lessons
- Discussed automatic translation tools and Computer-Assisted Translation (CAT) tools in previous classes.
- Emphasized the importance of various elements within CAT tools, including translation memories and segmentation techniques.
Importance of Terminology Bases
- Introduced the concept of terminology bases as a critical component in the translation process.
- Highlighted that effective documentation enhances translation accuracy, even for non-experts in specific fields like biology or medicine.
Understanding Terminology Management
Definition and Characteristics of Terms
- Defined a term as a specialized unit of knowledge, crucial for precise communication in specific domains such as medicine or chemistry.
- Explained that terms must have clear definitions that vary based on context, emphasizing their role in specialized communication.
Eugene Gustav's Theory of Terminology
- Introduced Eugene Gustav's theory as foundational to understanding terminology management, asserting its independence from general linguistics.
- Stressed that terminology should focus on concepts rather than just words, ensuring clarity and uniformity in technical communication.
Key Principles from Woster’s Theory
Core Concepts
- Normative Approach: Woster viewed terminology as a normative discipline aimed at establishing rules for term usage rather than merely describing existing practices.
- Conceptual Foundation: Emphasized that concepts are central to terminology; terms should be assigned only after defining the underlying concept clearly.
Uniqueness and Normalization
- Advocated for univocity where each concept corresponds to one unique term to avoid confusion caused by synonyms or polysemy.
- Discussed normalization processes aimed at creating consistent rules for term usage to enhance transparency in knowledge sharing and technical translations.
Understanding Terminology Databases
Theoretical Foundations of Terminology
- The general theory of terminology aims to eliminate ambiguities and misunderstandings in specialized languages, particularly in science and technology.
- This theory established terminology as a scientific discipline, focusing on creating rules for the correct use of terms rather than describing human language usage.
- Its primary goal is to generate guidelines that ensure clarity and precision in technical and scientific communication.
What is a Terminology Database?
- A terminology database is a systematic collection of specialized terms formatted for easy access through software.
- Unlike glossaries or dictionaries, these databases are dynamic, structured specifically for quick searches across multiple languages.
- They aim to guarantee coherence, accuracy, and quality in translations by organizing data effectively.
Differences Between Glossaries and Terminology Databases
- While both glossaries and terminology databases contain similar information, glossaries can be created manually without the need for extensive data systems.
- Glossaries serve as terminological tools that enhance coherence in specialized communication but lack the systematic structure of databases.
- Terminology databases can be updated continuously with new information, unlike static glossaries which may not reflect recent changes.
Importance of Terminology Databases in Translation
- Maintaining terminological consistency is crucial for professional quality in translations; it saves time and enhances overall effectiveness.
- These databases allow for interoperability across different translation projects when dealing with similar topics.
Terminology Databases and Their Construction
Overview of Terminology Databases
- The IAT (Interactive Terminology of Europe) is highlighted as a free-access database with 1.4 million multilingual entries, serving the European Union.
- UN Term, the United Nations Terminology Database, contains technical terms in six official languages: Arabic, Chinese, English, Russian, and Spanish.
- The discussion transitions to how terminology databases are constructed.
Steps in Building a Terminology Database
Step 1: Extraction of Terminology
- Two primary methods for extracting terminology are introduced: statistical methods and linguistic methods.
- Statistical methods involve using a corpus fed with relevant text to identify frequently occurring words.
- Linguistic methods focus on identifying content words based on their characteristics to determine if they qualify as terms.
Step 2: Finding Equivalents
- After defining terms, the next step is searching for equivalents in the target language using reliable sources like dictionaries and glossaries.
Step 3: Creating Data Sheets
- Each term must be documented with essential elements such as grammatical category, definition, context usage, and additional information to ensure reliability.
Step 4: Validation Process
- Validation involves reviewing terminology with experts or checking against trusted sources to confirm accuracy.
Step 5: Digital Storage of Database
- Final storage typically uses standardized files like TVX or spreadsheets (Excel), depending on the software used for managing databases.
Essential Elements of a Terminology Database Entry
- Key components include:
- Source language term
- Target language equivalent
- Definition
- Grammatical category
- Contextual usage notes
Tools and Formats for Managing Terminology Databases
- Various programs can be utilized for creating and managing these databases:
- MultiTerm or SDL Trados for professional use.
- Excel or Google Sheets for simpler bases that allow easy exportation.
- Standard formats required for compatibility across different CAT tools include TVX, CSB, or SDLTV.