grid search,model training
Try all combinations of hyperparameter values.
355 technical terms and definitions
Try all combinations of hyperparameter values.
Collection of thread blocks.
Mix using grid patterns.
Sudden generalization long after overfitting.
Sudden jump in generalization long after training loss plateaus.
Groq and Cerebras build custom AI chips with massive parallelism. Very fast inference for supported models.
Gross die count includes all die on wafer regardless of functionality or testing.
Gross margin is revenue minus cost of goods sold representing manufacturing profitability.
Revenue minus cost of goods sold as percentage.
Ground bounce is supply voltage noise caused by simultaneous switching of multiple outputs inducing current spikes through package inductance.
Voltage fluctuation on ground due to switching.
Generate from provided sources.
Learn language through interaction with environment.
Common ESD protection device.
Groundedness ensures generated content is supported by provided sources.
Electrical grounding to prevent ESD.
Open-set detection using language.
Connect to external information.
Grounding provides electrical path to earth dissipating static charge.
Grounding connects model output to verified sources. RAG, knowledge graphs, and retrieval help keep responses factual.
Ensure AI outputs are based on retrieved facts rather than hallucinated.
Convolutions over symmetry groups.
Group recommendations suggest items satisfying preferences of multiple users simultaneously.
Group split keeps related samples together. Prevents data leakage.
Process channels in groups.
Grouped convolutions partition input channels into groups processing each independently reducing parameters.
GQA shares key-value heads across query heads. Reduces KV cache size. Llama 2 70B uses GQA.
Middle ground between MQA and multi-head attention.
Share K/V within groups.
Normalize within groups of channels.
Quantum search algorithm.
Drive AI adoption: onboarding, education, quick wins. Measure engagement and retention. Iterate on UX.
gRPC is high-performance RPC framework. HTTP/2, protobuf. Streaming support.
GRU4Rec applies gated recurrent units to session-based recommendation by modeling sequential user behavior within sessions.
Grounded compositional generalization.
Math word problems.
Grade School Math 8K tests mathematical reasoning on word problems.
Grade school math word problems.
Graph Transformer Networks learn new graph structures through soft edge selection for heterogeneous graphs.
Guanaco is QLoRA fine-tuned model. Efficient training on single GPU.
Safety margin in specifications.
Measure latchup protection.
Structures to prevent latch-up.
Guardbanding tightens test limits beyond specifications to reduce test escapes and improve shipped quality.
Add margin to account for variation.
Framework for adding structure validation and safety to LLM outputs.
Guardrails constrain model behavior preventing specific undesired outputs.
Guardrails prevent unwanted model behavior. Topic restrictions, format requirements, safety filters.
Strength of conditioning guidance.
Guidance scale controls trade-off between prompt adherence and sample diversity in guided generation.