chunk overlap, rag
Chunk overlap prevents splitting important information across boundaries.
1,005 technical terms and definitions
Chunk overlap prevents splitting important information across boundaries.
Overlap between chunks to avoid splitting important context at boundaries.
Find optimal chunk length.
Tune size of text chunks to balance context and retrieval precision.
Chunk size determines granularity of text segments for retrieval.
Chunked prefill processes long prompts in chunks. Disaggregated prefill separates from decode. Better scheduling.
Chunking splits documents for embedding/retrieval. Use semantic boundaries, 512-1024 tokens, with overlap for continuity.
CI/CD automates test and deploy. GitHub Actions, GitLab CI.
I can outline CI/CD pipelines (build, test, deploy), recommend tools, and show how to automate key checks.
Stop calling failing service to prevent cascade.
Circuit breakers halt requests to failing services allowing recovery time.
Circuit breaker stops calling failing services. Fallback to cached response or simpler model. Graceful degradation.
Find functional circuits in networks.
Modify circuit using FIB for debug.
Circular economy approaches maximize material reuse refurbishment and recycling minimizing waste in semiconductor manufacturing lifecycle.
Whether citations support claims.
Analyze legal citation networks.
Generate proper citations.
Citations reference sources supporting generated information.
Citations link generated text to source documents. Important for trust and fact-checking. Include doc IDs or quotes.
Format citations. APA, MLA, Chicago. Bibliography generation.
Collaborative Knowledge-Aware Attention Network jointly learns from user-item and item-entity graphs for recommendations.
Cocke-Kasami-Younger algorithm performs bottom-up chart parsing for context-free grammars in cubic time complexity.
Contrastive Learning for Sequential Recommendation uses augmentation and InfoNCE loss for representation learning.
Identify factual claims in text.
System for identifying check-worthy claims.
ClariNet combines Gaussian inverse autoregressive flow with WaveNet for parallel fast speech synthesis.
Special token for classification in ViT.
Class weights adjust loss for imbalance. Higher weight for minority.
Weight classes by inverse frequency.
Add new classes over time.
AI planning using STRIPS or PDDL.
Classify dies into performance bins.
Classification predicts categories. Binary or multiclass.
Use classifier gradients to guide generation.
Use classifier to select data.
Guidance without separate classifier.
Classifier-free guidance steers diffusion models using conditional and unconditional score estimates.
Control generation strength by mixing conditional and unconditional predictions.
Classify text into categories. Topic, intent, type.
Anthropic's vision-capable model.
Anthropic's helpful honest and harmless AI assistant.
Identify key clauses in contracts.
Contrastive Learning for Cold-start Recommendation uses self-supervised learning to improve initialization.
Poison with correctly labeled examples.
Cleanlab finds label errors in data. Data-centric AI. Improve training data.
Specifications for contamination levels.
Best practices to minimize contamination.
Rating of particle count per volume (Class 1 10 100 etc).
Cleanroom garments prevent particle shedding from personnel into environment.