Online Linear Programming

Stochastic Games A (discounted) stochastic game with $N$ players consists of the following elements. A state space $\mathcal{X}$. For each player $i$ and state $x$, a set $A_i(x)$ of actions available to player $i$ in state $x$. For each player $i$, state……

Online Linear Programming

Consider a generic LP problem: $$ \begin{aligned} \max \;& \sum_{j=1}^n r_j x_j \\ \text {s.t. } & \sum_{j=1}^n a_{i j} x_j \leq b_i, \quad i=1, \ldots, m \\ & 0 \leq x_j \leq 1, j=1, \ldots, n, \end{aligned} \qquad \Longleftrightarrow \qquad \begin{aligned}……

Dirichlet Process

Dirichlet Process The original definition of the DP is due to Ferguson (1973)1 . Given a measurable space $(\Omega, \mathcal{F})$, a random distribution (measure) $G$ is said to follow a Dirichlet pocess with a……

Nonparametric learning methods

Robust Dynamic Pricing with Demand Learning in the Presence of Outlier Customers OR, 2022. Articles in Advance. On implications of demand censoring in the newsvendor problem MS 2013 这篇文章对于连续和离散分布的……

Parametric learning methods

Weak aggregating algorithm for the distribution-free perishable inventory problem ORL 2010 In this article, we propose a novel approach to the distributionfree, multi-period problem that utilizes recent advances in the theory of prediction and learning with expert advice. Weak Aggregating Algorithm (WAA)……

Optimal and Approximate Policies in Multiperiod, Multilocation Inventory Models with Transshipments

发表在 Operations Research, 1990. DOI: Subject classification: Inventory/production, multi-item/echelon/stage: multilocation models with lateral transshipments. This paper examines appropriate inventory policies when transshipments among multiple outlets are permitted as recourse actions once demands are observed. $x_t \in \mathrm{R}^n$: starting inventory……