I am an Associate Professor of Computer Science at Singapore Management University, specializing in the optimization of complex systems at the intersection of engineering and business. I bridge academic research with industrial application, having recently served as a Principal Research Scientist at Amazon (2024–2025) and previously as Deputy Director of Research for the Fujitsu-SMU Urban Computing and Engineering Corp Lab.
My research in urban computing and human decision-making focuses on real-world impact, earning awards from CIKM, AAMAS, and INFORMS. A frequent SPC/PC member for major AI conferences like AAAI and IJCAI, I publish widely across top-tier AI, operations, and transportation venues. I hold a Ph.D. from the University of Michigan and a B.S.E. from National Taiwan University.
We consider a dynamic pricing problem in network revenue management where customer behavior is predicted by a choice model, i.e., the multinomial logit (MNL) model. The problem, even in the static setting (i.e., customer demand remains unchanged over time), is highly non-concave in prices. Existing studies mostly rely on the observation that the objective function is concave in terms of purchasing probabilities, implying that the static pricing problem with linear constraints on purchasing probabilities can be efficiently solved. However, this approach is limited in handling constraints on prices, noting that such constraints could be highly relevant in some real business considerations. To address this limitation, in this work, we consider a general pricing problem that involves constraints on both prices and purchasing probabilities. To tackle the non-concavity challenge, we develop an approximation mechanism that allows solving the constrained static pricing problem through bisection and mixed-integer linear programming (MILP). We further extend the approximation method to the dynamic pricing context. Our approach involves a resource decomposition method to address the curse of dimensionality of the dynamic problem, as well as a MILP approach to solving sub-problems to near-optimality. Numerical results based on generated instances of various sizes indicate the superiority of our approximation approach in both static and dynamic settings.
@article{IJOC_2026,title={Constrained Pricing in Logit-based Revenue Management},volume={},pages={to appear},journal={INFORMS Journal on Computing},author={Shao, Qian and Mai, Tien and Cheng, Shih-Fen},year={2026},doi={10.1287/ijoc.2024.0852},smu={11030},}
Pricing a global product differently across multiple regions is a common but controversial practice. Although price differentiation helps capture unique market characteristics, it also encourages parallel trade, which may affect the overall corporate performance of a global company. We study this problem with a single global business unit (GBU) and multiple local business units (LBUs). The GBU manufactures a product and sets a transfer price for supplying the product to all LBUs, and LBUs decide retail prices for their respective regional markets. Customers can purchase products in any region by comparing LBUs’ prices and other parallel-imported factors, and we construct the demand by a mixed-multinomial logit model. Therefore, each LBU needs to consider the other LBUs’ prices when setting its price, and we formulate this problem as a two-stage game-theoretic model. We verify the existence of a pure-strategy Nash equilibrium. We then develop a learning-based algorithm to find the equilibrium prices of LBUs. Our algorithm is computationally efficient with a large scale of decision-makers. Even for cases with 100 decision-makers, the pure-strategy Nash equilibrium can be obtained within 30 minutes. Numerical studies are conducted using data from the fast-moving consumer products industries. Although parallel trade is detrimental to some LBUs, the existence of parallel trade surprisingly leads to higher overall corporate profits, which is made possible by making the product available at different price points in a market. The proposed method also enables the quantitative validation of several conjectures on parallel trading practices.
@article{AOR_2025,title={Pricing Strategies in Global Channels: Considering the Effects of Parallel Trade},volume={356},number={2},pages={949-983},journal={Annuals of Operations Research},author={Kao, Yuan-Mao and Yang, Yang and Cheng, Shih-Fen and Wu, Cheng-Hung},year={2025},smu={10518},doi={10.1007/s10479-025-06834-y}}
This paper studies integrating the crowd workforce into next-day home delivery services. In this setting, both crowd drivers and contract drivers collaborate in making deliveries. Crowd drivers have limited capacity and can choose not to deliver if the presented tasks do not align with their preferences. The central question addressed is: How can the platform minimize the total task fulfilment cost, which includes payouts to crowd drivers and additional payouts to contract drivers for delivering the unselected tasks by customizing task displays to crowd drivers? To tackle this problem, we formulate it as a finite-horizon Stochastic Decision Problem, capturing crowd drivers’ utility-driven task preferences, with the option of not choosing a task based on the displayed options. An inherent challenge is approximating the non-constant marginal cost of serving orders not chosen by crowd drivers, which are then assigned to contract drivers. We address this by leveraging a common approximation technique, dividing the service region into zones. Furthermore, we devise a stochastic look-ahead strategy that tackles the curse of dimensionality issues arising in dynamic task display execution and a non-linear (problem specifically concave) boundary condition associated with the cost of hiring contract drivers. In experiments inspired by Singapore’s geography, we demonstrate that choice-based crowd shipping can reduce next-day delivery fulfilment costs by up to 16.9%.
The observed cost savings are closely tied to the task display policies and the task choice behaviors of drivers.
@article{EJOR_2025,title={Choice-based Crowdshipping for Next-day Delivery Services: A Dynamic Task Display Problem},volume={328},number={1},pages={336-348},journal={European Journal of Operational Research},author={Arslan, Alp and Kilci, Firat and Cheng, Shih-Fen and Misra, Archan},year={2025},smu={10430},}
Search frictions and misallocation are common in decentralized transportation markets. Using novel trip-level data of taxis in Singapore, this paper examines the impact of real-time demand information at airport terminals on search frictions. The information reduces taxi supply misallocation, increasing deadheading speed by 16.3% and decreasing deadheading time by 10.77%, benefiting both passengers and drivers. It raises daily earnings by $3.70 USD and adds 6.2 minutes of operational time per airport-trip taxi. Spatial spillovers are primarily observed among drivers in adjacent districts. Taxis from the Budget Terminal and drivers with fewer prior airport pickups benefit more from this information.
@article{RESTAT_2025,title={Information Provision and Search Frictions: Evidence from the Taxi Industry in Singapore},volume={},pages={to appear},journal={The Review of Economics and Statistics},author={Agarwal, Sumit and Cheng, Shih-Fen and Keppo, Jussi and Wang, Long and Yang, Yang},year={2025},}
IJCAI-24
Enabling sustainable freight forwarding network via collaborative games
Freight forwarding plays a crucial role in facilitating global trade and logistics. However, as the freight forwarding market is extremely fragmented, freight forwarders often face the issue of not being able to fill the available shipping capacity. This recurrent issue motivates the creation of various freight forwarding networks that aim at exchanging capacities and demands so that the resource utilization of individual freight forwarders can be maximized. In this paper, we focus on how to design such a collaborative network based on collaborative game theory, with the Shapley value representing a fair scheme for profit sharing. Noting that the exact computation of Shapley values is intractable for large-scale real-world scenarios, we incorporate the observation that collaboration among two forwarders is only possible if their service routes and demands overlap. This leads to a new class of collaborative games called the Locally Collaborative Games (LCGs), where agents can only collaborate with their neighbors. We propose an efficient approach to compute Shapley values for LCGs, and numerically demonstrate that our approach significantly outperforms the state-of-the-art approach for a wide variety of network structures.
@inproceedings{tan_ijcai_2024,title={Enabling sustainable freight forwarding network via collaborative games},booktitle={Thirty-Third International Joint Conference on Artificial Intelligence},address={Jeju, Korea},author={Tan, Pang Jin and Cheng, Shih-Fen and Chen, Richard},year={2024},month=aug,smu={9505},}
ICAPS-24
Imitating cost-constrained behaviors in reinforcement learning
Complex planning and scheduling problems have long been solved using various optimization or heuristic approaches. In recent years, imitation learning that aims to learn from expert demonstrations has been proposed as a viable alternative to solving these problems. Generally speaking, imitation learning is designed to learn either the reward (or preference) model or directly the behavioral policy by observing the behavior of an expert. Existing work in imitation learning and inverse reinforcement learning has focused on imitation primarily in unconstrained settings (e.g., no limit on fuel consumed by the vehicle). However, in many real-world domains, the behavior of an expert is governed not only by reward (or preference) but also by constraints. For instance, decisions on self-driving delivery vehicles are dependent not only on the route preferences/rewards (depending on past demand data) but also on the fuel in the vehicle and the time available. In such problems, imitation learning is challenging as decisions are not only dictated by the reward model but are also dependent on a cost-constrained model. In this paper, we provide multiple methods that match expert distributions in the presence of trajectory cost constraints through (a) Lagrangian-based method; (b) Meta-gradients to find a good trade-off between expected return and minimizing constraint violation; and (c) Cost-violation-based alternating gradient. We empirically show that leading imitation learning approaches imitate cost-constrained behaviors poorly and our meta-gradient-based approach achieves the best performance.
@inproceedings{shao_icaps_2024,title={Imitating cost-constrained behaviors in reinforcement learning},booktitle={Thirty-Fourth International Conference on Automated Planning and Scheduling},address={Banff, Alberta, Canada},author={Shao, Qian and Varakantham, Pradeep and Cheng, Shih-Fen},year={2024},month=jun,smu={9496},}
This paper investigates the impact of neighborhood retail amenities on taxi trip behavior. We exploit a natural experiment in Singapore where the government introduced a new policy that allows retail shops to operate in residential neighborhoods. We find that the introduction of retail amenities in residential neighborhoods leads to a significant increase in taxi trips. This increase is driven by the increase in the number of trips to and from the retail amenities. We also find that the increase in taxi trips is more pronounced during the weekends and evenings. Our findings suggest that the introduction of retail amenities in residential neighborhoods can have a significant impact on taxi trip behavior.
@article{lee_habitat_2023,title={Neighborhood retail amenities and taxi trip behavior: A natural experiment in Singapore},volume={131},pages={102714},journal={Habitat International},author={Lee, Kwan Ok and Cheng, Shih-Fen},year={2023},smu={7620},}
We study the role of ride-hailing surge factors on the allocative efficiency of taxis by combining a reduced-form estimation with structural analyses using machine-learning-based demand predictions. Where other research study the effect of entry on incumbent taxis, we use higher frequency granular data to study how location-time-specific surge factors affect taxi bookings to bound the effect of customer decisions while accounting for various confounding variables. We find that even in a unique market like Singapore, where incumbent taxi companies have app-based booking systems similar to those from ride-hailing companies like Uber, the estimated upper bound on the cross-platform substitution between ride-hailing services and taxi bookings is only 0.26. On the other hand, we show that incorporating surge price factor improves the precision of demand prediction by 12% to 15%. Our structural analyses based on a driver guidance system finds this improved accuracy in demand prediction reduces drivers’ vacant roaming times by 9.4% and increases the average number of trips per taxi by 2.6%, suggesting the price information is valuable across platforms, even if elasticities are low.
@article{agarwal_trc_2022,title={The impact of ride-hail surge factors on taxi bookings},volume={136},pages={103508},journal={Transportation Research Part C},author={Agarwal, Sumit and Charoenwong, Ben and Cheng, Shih-Fen and Keppo, Jussi},year={2022},smu={6955},}
The arrangement of last-mile services is playing an increasingly important role in making public transport more accessible. We study the use of ridesharing in satisfying last-mile demands with the assumption that demands are uncertain and come in batches. The most important contribution of our paper is a two-level Markov decision process framework that is capable of generating a vehicle-dispatching policy for the aforementioned service. We introduce state summarization, representative states, and sample-based cost estimation as major approximation techniques in making our approach scalable. We show that our approach converges and solution quality improves as sample size increases. We also apply our approach to a series of case studies derived from a real-world public transport data set in Singapore. By examining three distinctive demand profiles, we show that our approach performs best when the distribution is less uniform and the planning area is large. We also demonstrate that a parallel implementation can further improve the performance of our solution approach.
@article{agussurja_ts_2019,title={A state aggregation approach for stochastic multi-period last-mile ride-sharing problem},volume={53(1)},pages={148--166},journal={Transportation Science},author={Agussurja, Lucas and Cheng, Shih-Fen and Lau, Hoong Chuin},year={2019},smu={4326},}