roofline model, optimization
Performance model relating bandwidth and compute.
3,145 technical terms and definitions
Performance model relating bandwidth and compute.
Encode relative positions through rotation.
RotatE models relations as rotations in complex space enabling composition and inversion properties.
Rotational embeddings in complex space.
Rough-cut capacity planning validates feasibility of master production schedule at aggregate level.
Route inputs to specialized subnetworks.
Router z-loss stabilizes expert routing preventing domination.
Route tokens to reduce attention cost.
Random negative slope during training.
Fast heating for short high-temperature treatments.
Derive interpretable rules from trained models.
Run-around loops circulate fluid between exhaust and supply coils transferring thermal energy.
Operate until breakdown.
Ruptures provides algorithms for offline change point detection including binary segmentation and dynamic programming.
Recurrent Variational Autoencoder models sequences through hierarchical latent variables with temporal structure.
Receptance Weighted Key Value combines RNN efficiency with transformer expressiveness.
RNN-like architecture competitive with Transformers.
Efficient SSM using special parameterization.
Structured State Space model uses diagonal approximations for efficient training.
Simplified diagonal state space model improves training stability and efficiency.
Safety classifiers predict whether content violates policy guidelines.
Safety fine-tuning adjusts model parameters to reduce harmful outputs.
Systems preventing harmful outputs.
Safety stock is buffer inventory maintained to protect against demand variability and supply disruptions ensuring production continuity.
Safety training teaches models to decline harmful requests and follow guidelines.
Safety = preventing harmful, illegal, or sensitive outputs. Use policies, classifiers, rule-based filters, and human review for high-risk use cases.
Self-Attention Graph Pooling selects important nodes based on learned attention scores enabling differentiable coarsening for graph classification.
Use attention for pooling.
Highlight which input tokens most influence the output.
Universal segmentation model.
Sandwich rule trains largest and smallest subnetworks alternately plus random architectures for better supernet training.
Mix standard and efficient layers.
SAP provides ERP solutions for semiconductor manufacturing including production planning quality management and supply chain integration.
Seasonal ARIMA extends ARIMA by incorporating seasonal differencing and seasonal AR/MA terms for periodic patterns.
SavedModel is TensorFlow's universal serialization format including computation graph and metadata.
Scalable oversight develops methods for humans to supervise superhuman AI systems.
Scale AI provides enterprise data labeling. Nucleus for data curation.
Theory that simply making models larger leads to better performance.
Relationships between model size data size compute and performance.
Scan chain stitching connects scan cells into shift register chains.
Scan chains convert sequential elements into shift registers enabling serial access to internal states for controllability and observability.
串联 flip-flops for testing internal logic.
Ultrasonic imaging for defects.
Planned downtime for PM.
Schema validation verifies generated structured data matches specifications.
Continuous-filter equivariant network for molecules.
SchNet uses continuous-filter convolutions on interatomic distances with rotationally invariant features for quantum chemistry predictions.
Science-based targets align emissions reductions with climate science to limit global warming.
ML for scientific computing.
Entailment from science questions.