parameter count vs training tokens, planning
Tradeoff in resource allocation.
3,145 technical terms and definitions
Tradeoff in resource allocation.
Total number of trainable weights in the model.
Parameter sharing applies same weights to multiple operations enabling efficient architectures with fewer parameters.
Activations with learnable parameters.
Pareto-optimal neural architecture search discovers architectures optimal for multiple objectives like accuracy and latency.
Tight control of Lipschitz constant.
Parti generates images through autoregressive sequence modeling of visual tokens.
Source has extra classes.
Particle filter performs sequential Monte Carlo inference in non-linear non-Gaussian state space models using weighted particle representations.
Particulate abatement removes solid particles from exhaust using filters or electrostatic precipitators.
Discriminate on patches.
Patch Time Series Transformer divides series into patches reducing tokens and improving long-horizon forecasting efficiency.
Analyze patent documents with NLP.
Analyze patent documents.
Categorize patents.
Help write patent applications.
Find similar patents.
Path encoding represents architectures through enumeration of computational paths from input to output.
Intervene on specific paths.
Analyze tissue slides.
Assess patient risk levels.
Model PBTI degradation.
PC algorithm learns directed acyclic graphs from data through conditional independence tests.
Partially Connected DARTS reduces memory costs by searching over subsets of edges rather than entire architecture graph.
PCMCI+ extends PCMCI with improved conditional independence testing for high-dimensional time series.
PCMCI combines PC algorithm with momentary conditional independence testing for causal discovery in multivariate time series.
Pruned Exact Linear Time algorithm efficiently detects multiple change points in univariate time series.
Different scales for each channel.
Single scale for entire tensor.
Generalized Perceiver for various input/output modalities.
Cross-attention to latent array for processing arbitrary inputs.
Compress to perceptually meaningful space.
Loss based on feature similarity.
Performance prediction estimates architecture accuracy from features or partial training reducing search costs.
Analyze code to find bottlenecks.
Performer approximates attention using random feature maps for linear complexity.
Fast attention using kernel methods.
Predict membrane permeability.
Permutation invariant training resolves label ambiguity in multi-speaker separation through minimum assignment loss.
Generate with specific persona.
Customize therapy for each patient.
Google's toxicity detection.
Google's API for detecting toxic comments.
Perfluorocompound abatement reduces greenhouse gas emissions from plasma etching and CVD through combustion or catalytic destruction.
PFC destruction efficiency measures percentage of perfluorocompounds converted by abatement systems.
Iterative attack projecting onto constraint set.
Identify key features for biological activity.
Sudden capability emergence.
Abrupt changes in model behavior.
Phenaki generates variable-length videos from open-domain textual prompts.