recency bias, training phenomena
Recent examples influence more.
3,145 technical terms and definitions
Recent examples influence more.
Combine attention with external memory.
Combine recurrence with attention.
Model dynamics with recurrent and stochastic components.
RNN/LSTM for temporal modeling.
Recursive forecasting uses one-step-ahead model iteratively feeding predictions as inputs for multi-step forecasts.
Supervise reward model with another model.
Recursive reward modeling trains reward models using other reward models hierarchically.
Adversarial testing by human experts.
Adversarial testing to find model vulnerabilities weaknesses or harmful behaviors.
Red-teaming probes models for harmful behaviors informing safety training.
Use reference image to guide.
Reference images guide generation by transferring style or content through adapter networks.
Find objects from descriptions.
Generate descriptions of objects.
Reflection enables agents to critique their own plans and outputs identifying improvements.
Agent learns from feedback and mistakes by generating reflections and improving.
Reformer uses locality-sensitive hashing for approximate attention matching.
Use LSH attention to reduce complexity from quadratic to linear.
Model declining to answer.
Balance safety and helpfulness.
Refusal training explicitly teaches models when to decline requests.
Teach model when to refuse.
Subclass not using inherited methods.
Regenerative thermal oxidizers recover heat through ceramic beds improving energy efficiency.
Regex constraints enforce pattern matching during text generation.
Caption specific image areas.
Goal in online learning to minimize cumulative mistakes.
Reinforcement learning guides graph generation through rewards for desired properties like drug-likeness.
Use RL to search architectures.
Allow model to say "I don't know".
Use relation extraction as pretext task.
Networks explicitly modeling pairwise relations.
Relation-aware aggregation weights messages by edge types when updating node representations.
Transfer relationships between samples.
Learn relative position embeddings.
Optimize maintenance strategy.
Constrain generation using regex patterns.
Renewable energy credits represent environmental attributes of renewable electricity generation.
Renewable energy sources like solar and wind are increasingly adopted by fabs to reduce carbon footprint and energy costs.
Renewal processes model event sequences where inter-arrival times are independent identically distributed random variables.
Rényi differential privacy generalizes epsilon-delta framework using Rényi divergence.
Reorder point is the inventory level triggering replenishment orders calculated from lead time demand and desired service level.
Repetition penalty reduces probability of recently generated tokens.
Training objective in ELECTRA.
Replanning revises strategies when plans fail or conditions change.
Parallel simulations at different temperatures.
Replicate hosts ML models with simple API. Community models. Easy to deploy custom models.
Analyze entire codebases to understand architecture and dependencies.
Compare representations across models.