主页

Online Linear Programming

2023/1/8 运筹与优化 947

Stochastic Games A (discounted) stochastic game with $N$ players consists of the following elements. A state space $\mathcal{X}$. For each player $i$ and state $x$, a set $A_i(x)$ of actions available to player $i$ in state $x$. For each player $i$, state……

Online Linear Programming

2023/1/8 运筹与优化 234

Consider a generic LP problem: $$ \begin{aligned} \max \;& \sum_{j=1}^n r_j x_j \\ \text {s.t. } & \sum_{j=1}^n a_{i j} x_j \leq b_i, \quad i=1, \ldots, m \\ & 0 \leq x_j \leq 1, j=1, \ldots, n, \end{aligned} \qquad \Longleftrightarrow \qquad \begin{aligned}……

minimax 定理

2022/11/16 分析与概率 341

Max–min inequality 一个矩阵，行最小值的最大值，不超过其列最大值的最小值。设 $f:X \times Y\to \ma……

Dirichlet Process

2022/11/14 分析与概率 218

Dirichlet Process The original deﬁnition of the DP is due to Ferguson (1973)1 . Given a measurable space $(\Omega, \mathcal{F})$, a random distribution (measure) $G$ is said to follow a Dirichlet pocess with a……

Multi-armed Bandits (3)

2022/11/12 运筹与优化/MAB 814

Lower Bound 证明 lower bound，我们需要构造一些问题实例（problem instances, $\mathcal{……

Analytics for an Online Retailer: Demand Forecasting and Price Optimization

2022/10/25 论文简读 2317

发表在 Manufacturing & Service Operations Management, 2016. DOI: https://doi.org/10.1287/msom.2015.0561. Keywords: online retailing; ﬂash sales; initial pricing; revenue management; price optimization; machine learning; regression trees; demand forecasting; demand interdependency; model implementation 这篇文章属于数……

Nonparametric learning methods

2022/10/1 运筹与优化 3550

Robust Dynamic Pricing with Demand Learning in the Presence of Outlier Customers OR, 2022. Articles in Advance. On implications of demand censoring in the newsvendor problem MS 2013 这篇文章对于连续和离散分布的……

Parametric learning methods

2022/10/1 运筹与优化 1688

Weak aggregating algorithm for the distribution-free perishable inventory problem ORL 2010 In this article, we propose a novel approach to the distributionfree, multi-period problem that utilizes recent advances in the theory of prediction and learning with expert advice. Weak Aggregating Algorithm (WAA)……

Multi-armed Bandits (2)

2022/9/30 运筹与优化/MAB 321

Contextual Bandits Lipschitz Bandits Continuum-armed bandits 先考虑 arm 是连续变量的特殊情况（CAB），不妨假设 arm 是 $X=[0, 1]$，其均值 $\mu(x)$ 满……

Optimal and Approximate Policies in Multiperiod, Multilocation Inventory Models with Transshipments

2022/9/25 论文简读 115

发表在 Operations Research, 1990. DOI: https://doi.org/10.1287/opre.38.2.278. Subject classification: Inventory/production, multi-item/echelon/stage: multilocation models with lateral transshipments. This paper examines appropriate inventory policies when transshipments among multiple outlets are permitted as recourse actions once demands are observed. $x_t \in \mathrm{R}^n$: starting inventory……