refusal training, ai safety
Teach model when to refuse.
145 technical terms and definitions
Teach model when to refuse.
Subclass not using inherited methods.
Regenerative thermal oxidizers recover heat through ceramic beds improving energy efficiency.
Regex constraints enforce pattern matching during text generation.
Caption specific image areas.
Goal in online learning to minimize cumulative mistakes.
Reinforcement learning guides graph generation through rewards for desired properties like drug-likeness.
Use RL to search architectures.
Allow model to say "I don't know".
Use relation extraction as pretext task.
Networks explicitly modeling pairwise relations.
Relation-aware aggregation weights messages by edge types when updating node representations.
Transfer relationships between samples.
Learn relative position embeddings.
Optimize maintenance strategy.
Constrain generation using regex patterns.
Renewable energy credits represent environmental attributes of renewable electricity generation.
Renewable energy sources like solar and wind are increasingly adopted by fabs to reduce carbon footprint and energy costs.
Renewal processes model event sequences where inter-arrival times are independent identically distributed random variables.
Rényi differential privacy generalizes epsilon-delta framework using Rényi divergence.
Reorder point is the inventory level triggering replenishment orders calculated from lead time demand and desired service level.
Repetition penalty reduces probability of recently generated tokens.
Training objective in ELECTRA.
Replanning revises strategies when plans fail or conditions change.
Parallel simulations at different temperatures.
Replicate hosts ML models with simple API. Community models. Easy to deploy custom models.
Analyze entire codebases to understand architecture and dependencies.
Compare representations across models.
Find most representative training examples.
Request queuing manages incoming requests ensuring fair processing order.
Normalization after residual addition.
Track information flow through residuals.
Residual stress analysis measures stress distributions in packages using techniques like Raman spectroscopy or warpage measurement.
Resolution multipliers adjust input image size trading accuracy for speed.
Response quality measures correctness helpfulness and safety of model outputs.
Guidelines for ethical AI development.
Responsible AI encompasses fairness, transparency, safety, privacy. Governance frameworks guide practice.
Responsible AI: fairness, transparency, accountability, safety. Governance ensures ethical development and use.
Retention mechanism computes attention-like aggregation through recursive formulation.
Detect diseases from retinal images.
Retentive Network uses retention mechanism for parallel training and efficient inference.
Retention mechanism for efficient parallel training and recurrent inference.
Use retrieved documents to inform generation.
Architecture designed for retrieval augmentation.
Plan synthetic routes backward from target.
Retry logic attempts failed requests again with exponential backoff.
Reverse osmosis purifies water by forcing it through semipermeable membranes removing dissolved solids.
Reward hacking finds unintended ways to maximize reward without intended behavior.
Reward model scores outputs based on human preferences. Train from comparisons. Used in RLHF.
Reward model scores outputs by human preference. Trained on comparison data. Guides RL fine-tuning.