NUDT-MCM2025 训练测试题NUDT-MCM2025 训练测试Problem.pdf

MCM 2025 Training Contest Problem
MCM 2025 训练赛题

Which is better? Estimating Public Preference for Roads
哪个更好？估计公众对道路的偏好

In today’s era of ubiquitous mobile smart devices, car-pooling, and navigation services have become an integral part of daily traveling. At the heart of these services lies the essential function of route planning - the art of efficiently guiding individuals from their starting point to their intended destination. For providers of route planning services, the challenge goes beyond mere connectivity; it extends to delivering routes that resonate with users, as evidenced by the likelihood of users choosing these recommended paths. Elevated user satisfaction typically correlates with an increased rate of route selections that align with the planned routes, a measure that reflects the efficacy of the route planning algorithm. While traditional route planning leverages weighted graphs and shortest-path algorithms (such as Dijkstra’s algorithm).
在当今移动智能设备无处不在的时代，拼车和导航服务已成为日常出行不可或缺的一部分。这些服务的核心是路线规划的基本功能 - 有效地引导个人从起点到预定目的地的艺术。对于路线规划服务提供商来说，挑战不仅仅是连接;它扩展到提供与用户产生共鸣的路由，用户选择这些推荐路径的可能性就证明了这一点。用户满意度的提高通常与与规划路线一致的路线选择率的增加相关，这一衡量标准反映了路线规划算法的有效性。而传统的路线规划利用加权图和最短路径算法（例如 Dijkstra 算法）。

The key reason for the divergence between shortest paths and user preferences lies in the choice of weightings for the edges in the weighted graph. Intuitive choices like distance or travel time often prove inadequate. Figure 1 of Route Planning with Different Goals illustrates a typical case: given the origin A and destination B, Route 1 represents the popular user choice. Route 2 is the fastest (corresponding to the shortest path from A to B when edge weights are average travel times). Route 3 is the shortest (corresponding to the shortest path when the edge weights are defined as road length). Both Route 2 and Route 3 differ from the user’s popular choice, underscoring the fact that real-world route selection involves more intricate considerations that are hidden within historical data and are challenging to quantify using a simple set of criteria. Expert manual optimization is not only inefficient but also yields suboptimal results.
最短路径和用户首选项之间差异的关键原因在于加权图中边的权重选择。事实证明，距离或旅行时间等直观选择往往是不够的。不同目标的路由规划图 1 说明了一个典型案例：给定源 A 和目标 B，路由 1 表示流行的用户选择。路径 2 是最快的（当边权重为平均行驶时间时，对应于从 A 到 B 的最短路径）。路线 3 是最短的（当边权重定义为道路长度时，对应于最短路径）。Route 2 和 Route 3 都与用户的热门选择不同，这凸显了这样一个事实，即实际的路由选择涉及隐藏在历史数据中的更复杂的考虑因素，并且很难使用一组简单的标准进行量化。专家手动优化不仅效率低下，而且会产生次优的结果。

Figure 1 Examples of Route Planning with Different Goals.
图 1 具有不同目标的路由规划示例。

Hence, existing route planning models are typically data-driven, aiming to harness historical data to achieve more realistic results than straightforward shortest-path algorithms. An effective method is to determine the edge weight of the road network so that most popular paths are the shortest paths between their start and end point on the weighted graph. A car-hailing company hopes that your team can solve the Edge Weight Estimation Problem defined as the following:
因此，现有的路线规划模型通常是数据驱动的，旨在利用历史数据获得比简单的最短路径算法更真实的结果。一种有效的方法是确定道路网络的边权重，以便大多数常用路径是加权图上其起点和终点之间的最短路径。一家网约车公司希望您的团队能够解决定义如下的 Edge Weight Estimation 问题：

Given a weighted, directed graph

G = (V, E)

representing the map of a city.

V

is the vertex set, each vertex

v \in V

represent a crossroad.

E

is the edge set, each edge

(v_{i}, v_{j}) \in E

represent the road segment between

v_{i}

and

v_{j}

. The weight of edge

(v_{i}, v_{j}) \in E

w (v_{i}, v_{j})

which need to be determined based on historical paths.
给定一个加权的有向图

G = (V, E)

，表示城市地图。

V

是顶点集，每个顶点

v \in V

代表一个十字路口。

E

是边集，每条边

(v_{i}, v_{j}) \in E

代表和之间的

v_{i}

v_{j}

路段。edge

(v_{i}, v_{j}) \in E

的权重是

w (v_{i}, v_{j})

需要根据历史路径来确定的。

For some vertexes

v_{s}, v_{t} \in V

, we have historical paths indicating how people used to go from

v_{s}

v_{t}

, denoted as a set

P (s, t) = {p_{1}, p_{2}, \dots, p_{| P (s, t) |}}

where

| P (s, t) |

is the number of paths in

P (s, t)

. Each path

p_{k} \in P (s, t)

from

v_{s}

v_{t}

is represent by some edges. The length of

p_{k}

is the sum of the weights of its edges, denoted as

L (p_{k})

.
对于某些顶点

v_{s}, v_{t} \in V

，我们有历史路径，指示人们过去是如何从到

v_{s}

v_{t}

的，表示为一个集合

P (s, t) = {p_{1}, p_{2}, \dots, p_{| P (s, t) |}}

，其中

| P (s, t) |

是中的

P (s, t)

路径数。从到的

v_{s}

v_{t}

每条路径

p_{k} \in P (s, t)

都由一些边缘表示。的长度

p_{k}

是其边的权重之和，表示为

L (p_{k})

。

When the edge weights are determined, for each

(s, t)

pair, we can find the length of the shortest path from

s

t

(Also called the distance from

s

t

), denoted as

D (s, t)

. We can also calculate

L (p_{k})

for

\forall p_{k} \in P (s, t)

based on the determined edge weights. For each

p_{k} \in P (s, t)

, define a similarity function

f (p_{k}) = {\begin{cases} 1 & if L (p_{k}) = D (s, t) \\ 0 & if L (p_{k}) > D (s, t) \end{cases}

.
当确定了边权重时，对于每

(s, t)

对，我们可以找到从

s

t

的最短路径的长度（也称为从

s

t

的距离），表示为

D (s, t)

。我们还可以根据确定的边权重进行计算

L (p_{k})

\forall p_{k} \in P (s, t)

。对于每个

p_{k} \in P (s, t)

，定义一个相似性函数

f (p_{k}) = {\begin{cases} 1 & if L (p_{k}) = D (s, t) \\ 0 & if L (p_{k}) > D (s, t) \end{cases}

。

Our goal is to maximize the overall similarity

SIM = \frac{\sum_{s, t \in V} \sum_{p_{k} \in P (s, t)} f (p_{k})}{\sum_{s, t \in V} | P (s, t) |}

by assigning weights to the edges in the graph G. For simplicity, we restrict the weights of the edges to be positive integers. Example 1 shows the calculation of SIM for a given graph, historical paths, and already determined the edge weight.
我们的目标是通过为图 G 中的边分配权重来最大化整体相似性

SIM = \frac{\sum_{s, t \in V} \sum_{p_{k} \in P (s, t)} f (p_{k})}{\sum_{s, t \in V} | P (s, t) |}

。为简单起见，我们将 theweightsofthe edges 限制为正整数。示例 1 显示了给定图形、历史路径和已确定的边缘权重的 SIM 计算。

Example 1: Given a graph

G = (V, E)

, Vertex set

V = {1, 2, 3, 4, 5, 6, 7, 8, 9}

, Edge Set

E =

{(1, 2), (1, 4), (2, 3), (2, 5), (3, 6), (4, 5), (4, 7), (5, 4), (5, 6), (5, 8), (6, 5), (6, 9), (7, 8), (8, 9), (9, 8)}

示例 1：给定一个图形

G = (V, E)

、顶点集

V = {1, 2, 3, 4, 5, 6, 7, 8, 9}

、边集

E =

{(1, 2), (1, 4), (2, 3), (2, 5), (3, 6), (4, 5), (4, 7), (5, 4), (5, 6), (5, 8), (6, 5), (6, 9), (7, 8), (8, 9), (9, 8)}

Historical paths set

P (1, 7) = {p_{1}, p_{2}}, P (1, 9) = {p_{3}, p_{4}}

, Where

p_{1} =< (1, 2), (2, 5), (5, 4), (4, 7) >

p_{2} =< (1, 2), (2, 5), (5, 8), (8, 7) >, p_{3} =< (1, 2), (2, 5), (5, 6), (6, 9) >, p_{4} =< (1, 2), (2, 3), (3, 6), (6, 5)

(5, 8), (8, 9) >

. If we determine the edge weights as Table 1.
历史路径集

P (1, 7) = {p_{1}, p_{2}}, P (1, 9) = {p_{3}, p_{4}}

，其中

p_{1} =< (1, 2), (2, 5), (5, 4), (4, 7) >

，

p_{2} =< (1, 2), (2, 5), (5, 8), (8, 7) >, p_{3} =< (1, 2), (2, 5), (5, 6), (6, 9) >, p_{4} =< (1, 2), (2, 3), (3, 6), (6, 5)

，，

(5, 8), (8, 9) >

.如果我们确定边缘权重，如表 1.

Table 1: Edge Weights Assignment of Graph

G

表 1：图形

G

的边缘权重分配

$v_{i}$	1	1	2	2	3	4	4	5	5	5	6	6	7	8	8	9
$v_{j}$	2	4	3	5	6	5	7	4	6	8	5	9	8	7	9	8
$w (v_{i}, v_{j})$	1	4	1	1	1	1	1	3	3	1	1	1	1	1	1	1

Weights Assignment 1, SIM=0.25
权重分配 1，SIM=0.25

Weights Assignment 2, SIM=0.5
权重分配 2，SIM=0.5

Then we can calculate SIM as follows:
那么我们可以按如下方式计算 SIM：
One of the shortest paths from 1 to 7 is

1 \to 2 \to 5 \to 8 \to 7

so that

D (1, 7) = 4

.
从 1 到 7 的最短路径之一是

1 \to 2 \to 5 \to 8 \to 7

D (1, 7) = 4

。

L (p_{1}) = 6 > D (1, 7), L (p_{2}) = 4 = D (1, 7)

, then

f (p_{1}) = 0, f (p_{2}) = 1

L (p_{1}) = 6 > D (1, 7), L (p_{2}) = 4 = D (1, 7)

然后

f (p_{1}) = 0, f (p_{2}) = 1

One of the shortest paths from 1 to 9 is

1 \to 2 \to 3 \to 6 \to 9

so that

D (1, 9) = 4

.
从 1 到 9 的最短路径之一是

1 \to 2 \to 3 \to 6 \to 9

D (1, 9) = 4

。

L (p_{3}) = 6 > D (1, 9), L (p_{4}) = 6 > D (1, 9), then f (p_{3}) = f (p_{4}) = 0

SIM = \frac{f (p_{1}) + f (p_{2}) + f (p_{3}) + f (p_{4})}{4} = 0.25

We can improve the SIM value in Example 1 if we revise one edge weight.
如果我们修改一个边权重，可以提高示例 1 中的 SIM 值。
same as that in Example 1. Then we can calculate SIM as follows:
同例 1 中。那么我们可以按如下方式计算 SIM：
One of the shortest paths from 1 to 7 is

1 \to 2 \to 5 \to 8 \to 7

so that

D (1, 7) = 4

. Let

w (5, 6) = 1

, others are the same as that in Example 1. Then we can calculate SIM as follows:
从 1 到 7 的最短路径之一是

1 \to 2 \to 5 \to 8 \to 7

D (1, 7) = 4

。令

w (5, 6) = 1

，其他的与示例 1 中的相同。那么我们可以按如下方式计算 SIM：

L (p_{1}) = 6 > D (1, 7), L (p_{2}) = 4 = D (1, 7), then f (p_{1}) = 0, f (p_{2}) = 1

One of the shortest paths from 1 to 9 is

1 \to 2 \to 3 \to 6 \to 9

so that

D (1, 9) = 4

.
从 1 到 9 的最短路径之一是

1 \to 2 \to 3 \to 6 \to 9

D (1, 9) = 4

。

\begin{aligned} L (p_{3}) = 4 = D (1, 9), L (p_{4}) & = 6 > D (1, 9), then f (p_{3}) = 1, f (p_{4}) = 0 \\ SIM = \frac{f (p_{1}) + f (p_{2}) + f (p_{3}) + f (p_{4})}{4} & = 0.5 \end{aligned}

Your task is to develop model(s) to address the following problems:
您的任务是开发模型来解决以下问题：
Problem 1: What is the maximum SIM of Example 1? Give the corresponding weights like Table 1.
问题 1：示例 1 的最大 SIM 卡是多少？给出相应的权重，如表 1。
Problem 2: If we set the weight as the length of the edge, please calculate the SIM value of Case 1. (All edge lengths in the input data are positive integers)
问题 2：如果我们将 weight 设置为边的长度，请计算情况 1 的 SIM 值。（输入数据中的所有边长均为正整数）

method on Case 1 and show the results.Problem 3: Given an arbitrary graph, please design a method to determine the edge weights that maximize SIM. Apply your method on Case 1 and show the results.
方法并显示结果。问题 3：给定一个任意图，请设计一种方法来确定最大化 SIM 的边权重。在案例 1 上应用您的方法并显示结果。

Problem 4: What is the difference between your estimated weights and the edge length? Show the advantage of your weights by analyzing some cases. (Hint: You may visualize some results on the map)
问题 4：您的估计重量和边缘长度有什么区别？通过分析一些案例来展示 yourweights 的优势。（提示：您可以在地图上可视化一些结果）

Problem 5: A one-two-page memo summarizing your researches and results with advice for executives of the car-hailing company.
问题 5：一份一两页的备忘录，总结您的研究和结果，并为网约车公司的高管提供建议。

MCM 2025 Training Contest ProblemMCM 2025 训练赛题

Which is better? Estimating Public Preference for Roads哪个更好？估计公众对道路的偏好

MCM 2025 Training Contest Problem
MCM 2025 训练赛题

Which is better? Estimating Public Preference for Roads
哪个更好？估计公众对道路的偏好