370_2617318598.pdf

Embed Size (px)

Citation preview

  • 7/27/2019 370_2617318598.pdf

    1/10

    19

    Transportation Research Record: Journal of the Transportation Research Board,

    No. 1985, Transportation Research Board of the National Academies, Washington,

    D.C., 2006, pp. 1928.

    An algorithm to solve explicitly the path enumeration problem is pro-

    posed. This algorithm is based on the branch-and-bound technique

    and belongs to the class of deterministic methods along with existing

    approaches that combine heuristic or randomization procedures with

    shortest-path search. The branch-and-bound algorithm is formulated,

    and a methodology is designed for the application of deterministic ap-

    proaches to a real case study. Path sets generated with different methods

    are compared for behavioral consistency, namely, the ability to reproduce

    actual routes chosen by individuals driving habitually from home to

    work. Choice set compositions for modeling purposes are determined for

    the consistency of the path generation process with the observed behav-

    ior. Further, model estimates and performance for different route choice

    specifications are examined for both path set compositions. Results sug-

    gest that the proposed branch-and-bound algorithm generates realistic

    and heterogeneous routes, reproduces better the observed behavior of the

    interviewed drivers, and produces a good choice set for route choice

    model estimation and performance comparison.

    Route choice behavior modeling generally considers separately the

    individuation of available alternative routes and the calculation of the

    probability of choosing a certain route from the generated choice set.

    Enumeration of alternatives is not a straightforward problem

    since the actual size of real networks creates problems in the choice

    set definition. The generated set should exclude unrealistic pathsthat no traveler would ever consider and highly similar paths that no

    traveler would ever differentiate between and include relevant and

    heterogeneous routes that different travelers would choose.

    Several approaches found in the literature to generate routes are

    based on variations to the shortest-path search. Definition of objective

    functions to be minimized, formulation of heuristic rules, and imple-

    mentation of randomization techniques are combined with K-shortest-

    path algorithms in order to create a path set. The analysis of relevance

    and heterogeneity of paths generated with methods relying on the

    shortest-path search suggests a different approach to the problem.

    An algorithm based on the branch-and-bound technique is pro-

    posed in this study, which intends to explicitly generate realistic and

    heterogeneous routes while limiting computational costs. This proce-

    dure constructs a connection tree between the origin and destination

    of a trip by processing sequences of links according to a branching

    rule that accounts for logical constraints formulated to increase route

    likelihood and heterogeneity. Each sequence of links connecting ori-

    gin and destination and satisfying all the constraints enters the choice

    set as a feasible solution to the path enumeration problem.

    The application of the branch-and-bound algorithm to a real case

    study is illustrated here. The procedure is compared with different

    deterministic techniques with respect to the ability to reproduce actual

    route choices of individuals driving habitually from home to work in

    an urban network. A choice set for modeling purposes resulting from

    the application of the method is constructed by considering path set

    consistency with the observed behavior and is used for comparing

    estimates and performance of different model specifications.

    DETERMINISTIC APPROACHESFOR PATH GENERATION

    The exhaustive approach to path generation assumes unrealistically

    that all physical routes connecting origin and destination of a trip are

    considered. Selective approaches account for deterministic and prob-

    abilistic procedures, depending on the methods used to generate the

    paths. The proposed branch-and-bound algorithm enters the class

    of deterministic approaches reviewed in this study. A review of the

    probabilistic methods for choice set generation may be found else-

    where (1). Prashker and Bekhor review the route choice models

    estimated for comparing the different generated choice sets (2).

    The most straightforward path generation approach searches for

    the first K-shortest paths that minimize the generalized path costs.

    Shortest-path algorithms, for example, that of Dijkstra (3), assumeimplicit awareness of all the link attributes. Van der Zijpp and

    Fiorenzo-Catalano (4) present a constrained method to generate

    routes by finding directly feasible K-shortest paths and exploiting

    a wide class of constraints.

    Ben-Akiva et al. (5) propose an approach for generating possible

    paths by labeling each route according to a criterion for which the

    path is optimum. This approach assumes that travelers may have dif-

    ferent objective functions. Each criterion corresponds to a different

    preferred route, and each route can be labeled according to a differ-

    ent objective function. Dial (6) generalizes the labeling method by

    constructing a set of efficient paths in which being efficient means

    minimizing a linear combination of label costs.

    Azevedo et al. (7) define an approach in which all the shortest-path

    links are removed from the network to find the next best path. Themain problem with the link elimination approach is related to net-

    work disconnection, since removing centroid connectors and major

    junctions does not guarantee the existence of more paths between the

    origin and the destination. A variant to this approach obviates the

    problem by eliminating individual links or a combination of links

    from the shortest path.

    De la Barra et al. (8) illustrate the link penalty approach, in which

    impedances of the shortest-path links are increased in order to cal-

    culate the next best path. The process continues until no more new

    paths are produced. Park and Rilett (9) modify this approach by not

    increasing the impedance on links within a certain distance from the

    Applying Branch-and-Bound Techniqueto Route Choice Set Generation

    Carlo Giacomo Prato and Shlomo Bekhor

    Transportation Research Institute, TechnionIsrael Institute of Technology, Haifa

    32000, Israel.

  • 7/27/2019 370_2617318598.pdf

    2/10

    origin or the destination. Scott et al. (10) optimize the program for

    determining the penalizing factor for impedances on shortest-path

    links in order to generate a next best path that overlaps with the

    shortest path by no more than a given number of links.

    Simulation methods assume that travelers erroneously perceive

    link attributes, and therefore extraction of random draws from a dis-

    tribution that might represent drivers perceptions appears to be

    suitable. Sheffi and Powell (11) apply the Monte Carlo technique

    with multinomial probit to the traffic assignment problem. Since theassignment algorithm does not present the network loading steps,

    the application presents analogies with path generation procedures.

    Fiorenzo-Catalano and Van der Zijpp (12) implement a version of the

    Monte Carlo technique by gradually increasing the variance of the

    random components in the model in order to keep the frequency with

    which new paths are found at a constant rate. Bekhor et al. (13) ver-

    ify the suitability of simulation methods to produce paths similar to

    those observed for a real case study in Boston.

    The aspect common to the foregoing approaches is the shortest-path

    search. The method proposed in this study provides an alternative to

    the shortest-path-based methods by applying the branch-and-bound

    technique to solve the path enumeration problem.

    In transportation-related problems, Friedrich et al. (14) apply a

    branch-and-bound assignment procedure for transit networks byusing a timetable-based search algorithm. Hoogendoorn-Lanser (15)

    adapts the same procedure for choice set generation in the analysis

    of multimodal transport networks. In these transit applications, the

    method exploits predefined route sections. In the following section,

    a different approach is designed for a road network.

    BRANCH-AND-BOUND ALGORITHM

    FOR PATH GENERATION

    The branch-and-bound algorithm enumerates paths by generating a

    tree of routes connecting an origin node o to a destination node d.

    The preprocessing phase of the algorithm creates arrays containing

    the network elements to be processed:

    An array lists the generic linksLi,j by tracing initial node i, finalnodej, predecessors ofi, successors ofj, lengthD(Li,j), travel time

    T(Li,j), and straight linear distanceDj,dfrom the final nodej of the link

    to the destination node d; Defining a path segment Po,kas the sequence of links connecting

    the origin node o to the initial node kof the next link processed, an

    array records path segments Po,kby registering lengthD(Po,k), travel

    time T(Po,k), number of left turns LT(Po,k), and straight linear distance

    Dk,dfrom the final node kof the path segment to the destination node

    d; and

    An array catalogs the set Ck of path segments Po,k arriving atnode k.

    The processing phase of the algorithm represents the centroid node

    of the origin zone as the root of the tree and the centroid connector

    to the origin node o as the only outgoing branch.

    Starting from the origin node o, given a path segment Po,x from the

    current tree level, all possible successors of nodex are considered. Let

    L*x,y be the currently processed link starting at nodex and terminating

    at nodey. Let P*o,y be the processed path segment between origin node

    o and nodey, formed by adding the linkL*x,y to the path segment Po,xarriving at nodex. Let Cy be the set of all the connections to nodey.

    L*x,y is inserted into the tree as an appendix of the connection set C*x if

    and only if all the following conditions hold:

    20 Transportation Research Record 1985

    Directional constraint. Consider the straight linear distanceD*y,dfrom the final nodey of the processed linkL*x,y to the destination d

    and the straight linear distanceD*x,d from the arriving nodex of the

    processed connection set C*x to the destination d. The processed link

    enters the connection set Cy if

    where D is a distance factor larger than 1. This directional con-straint excludes from consideration links that take the driver sig-

    nificantly farther from the destination and closer to the origin. The

    distance factor introduces a tolerance because it considers drivers

    that move slightly farther from the destination to reach a faster

    road, for example, a highway.

    Temporal constraint. Consider the travel time T(P*o,y) of theprocessed path segment P*o,y and the minimum travel time Tmin(Po,y)

    among the path segments belonging to the connection set Cy. The

    connection set Cy includes the processed path segment if

    where T is a time factor larger than 1. This temporal constraintrejects path segments that travelers would consider unrealistic sincetheir travel time is excessively high for connecting the origin node o

    to the arrival nodey.

    Loop constraint. Consider each linkL*i,j belonging to the set of

    links {L*o,(o+1),L*(o+1),(o+2), . . . ,L*(o+n),y} that form the processed path seg-

    ment P*o,y. Define the functionsNs(Li,j) andNe(Li,j) that map the initial

    and the final node of each linkLi,j. Consider a subsequence of links

    {L*r,(r+1), L*(r+1),(r+2), . . . , L*(r+m),s} that compose a path subsegment P*r,sconnecting two generic nodes rand s on the processed path segment

    P*o,y. Consider the length D(L*r,s) of each link belonging to the path

    subsegment P*r,s and the minimum distance D[Ns(L*r,(r+1)),Ne(L*(r+m),s)]

    between nodes rand s. The path segment P*o,y is excluded from the

    connection set Cy if there exists at least one path subsegment P*r,sfor which

    where L is a detour factor larger than 1. This loop constraint dis-cards path segments that travelers would not consider because they

    constitute a detour larger than an acceptable value.

    Similarity constraint. Considering the same definitions as forthe previous constraint, the path segment P*o,y is not taken into con-

    sideration in the connection set Cy if there exists at least one path

    subsegment P*r,s for which

    where o is an overlap factor smaller than 1. This similarity con-straint removes highly overlapping path segments that travelers

    would not consider as separate alternatives. The sequence of links

    {L*r,(r+1),L*(r+1),(r+2), . . . , L*(r+m),s} is actually a detour and the overlap

    factor is the minimum required detour length in order to individuate

    a different path segment.

    D L D N L N Lr sP

    s r r e r m s

    r s

    ,*

    ,*

    ,*

    ,*

    ,( ) > ( ) ( ) +( ) +( )1

    ( ) ( )( )

    +( ) +( ) 1 ss r s o yP P* ,* ,* ( )( ) 3

    T P T Po y T o y,*

    min ,( )( ) ( ) 2

    D Dy d D x d,*

    ,

    * ( ) 1

  • 7/27/2019 370_2617318598.pdf

    3/10

    Movement constraint. Consider the number of left turns LT(P*o,y)in the processed path segment P*o,y. The connection set Cy includes the

    processed path segment if

    where LT is the maximum number of left turns for each route. Espe-cially in urban networks where traffic light regulation does not reserve

    green time for left turns, these movements markedly increase traveltime by adding long waiting times at junctions before a left turn. This

    movement constraint removes unrealistic path segments, causing

    delay in terms of travel time and apprehension in drivers approaching

    the junction.

    The algorithm processes a tree level completely before path seg-

    ments of the next level are considered. Two queues store unprocessed

    segments of the current and the next level, and the level is completed

    when the queues are empty. The algorithm completes the connection

    search when all the tree levels are processed and nodey corresponds

    to destination node dfor all the branches.

    Figure 1 outlines the structure of the generic connection tree. The

    tree width depends on the network structure and may be wider than the

    usual shortest-path tree. According to operational research theory,the speed of the branch-and-bound algorithm depends linearly on

    the width of the connection tree but exponentially on its depth. Con-

    sequently the proposed technique behaves conveniently from the

    computational perspective.

    APPLICATION OF PATH

    GENERATION ALGORITHMS

    For modeling purposes, choice sets are considered consistent with the

    observed behavior if they contain the actual chosen route among paths

    produced with generation techniques. For this reason, different deter-

    LTLTPo y,

    * ( )( ) 5

    Prato and Bekhor 21

    ministic approaches to path enumeration are tested with respect to the

    ability to reproduce the observed behavior of individuals driving

    habitually from home to work in an urban network.

    Actual route choices were collected among faculty and staff mem-

    bers of Turin Polytechnic, in Italy, who participated voluntarily in a

    Web-based survey. Prato et al. (16) describe the details of the ques-

    tionnaire, which in the first part collected information about spatial

    abilities and driving attitudes of the respondents and in the second

    part recorded the chosen route and possible alternatives to reach theirworkplace. The application of path generation algorithms exploits the

    network of the city of Turin, built on the basis of the urban traffic plan

    designed by the municipality in 2001 and composed of 419 nodes

    and 1,427 links (17).

    Different path generation methods are evaluated with respect to the

    coverage of the collected routes. The coverage is the percentage of

    observations for which an algorithm generates a route that satisfies a

    threshold for the overlap measure:

    whereI() is the coverage function, equal to 1 when its argument is

    true and zero when its argument is false; Onris the overlap percentage;and is a threshold for the overlap measure.

    The overlap measure evaluates the consistency of a path enumer-

    ation method with respect to the observed behavior by considering

    the length of the links shared between generated and collected routes:

    whereLnr is the overlapping length between the path generated by

    algorithm rand the observed path for driver n, andLn is the length

    of the observed path for driver n.

    OL

    Lnr

    nr

    n

    = ( )7

    max ( )r

    nr

    n

    N

    I O ( )=

    1

    6

    tree width

    node d

    8

    7

    2

    1

    3

    4

    5

    6

    no e

    node d

    node d

    node d

    node d

    node d

    node d

    node d

    node d

    node d

    node o

    level 1 level 2 level 3 level 4 level 5

    tree depth

    FIGURE 1 Structure of generic connection tree.

  • 7/27/2019 370_2617318598.pdf

    4/10

    The ideal algorithm would reproduce perfectly the observed

    behavior by replicating link by link all the routes collected in the sur-

    vey and would result in 100% coverage for a 100% overlap thresh-

    old. The actual techniques partially reproduce the observed behavior,

    and an index measures the behavioral consistency of path generation

    methods with respect to the ideal algorithm by accounting for total

    overlap over all the observations:

    where CIris the consistency index of algorithm r, Onr,max is the max-

    imum overlap measure obtained with the paths generated by algo-

    rithm rfor the observed choice of each driver n, and Omax is the 100%

    overlap over all theNobservations for the ideal algorithm.

    Five different deterministic algorithms are applied with the

    objective of maximizing the coverage function:

    The branch-and-bound algorithm is implemented by definingthe parameters for the constraints of the branching rule: the dis-

    tance factor is 1.10, the time factor is 1.50, the loop factor is 1.20,

    the overlap factor is 0.80, and the maximum number of left turns

    is 4. The first factor introduces tolerance with respect to links that

    take the driver farther from the destination. The following three

    factors guarantee heterogeneity among paths, since routes that are

    too similar and overcircuitous are excluded from the generated

    path set.

    The labeling approach is applied by calculating the shortestpath with respect to route attributes such as distance, free-flow time,

    travel time, and delay, which measure the level of congestion by

    evaluating the difference between travel times in congested and in

    free-flow conditions.

    The link elimination approach is modified from the original for-

    mulation described by Azevedo et al. (7) by repeating for 10 itera-tions the following three-step method: (a) computation of the shortest

    path by considering the travel time, (b) elimination of a link belong-

    ing to the current shortest path, and (c) computation of the next-

    shortest path. Shortest-path links are eliminated if they take the driver

    farther from the destination or compel the driver to turn from a high

    hierarchical road to a low hierarchical road.

    The link penalty approach is adapted from the original methodproposed by De la Barra et al. (8) by replicating for 15 iterations

    the following three-step procedure: (a) computation of the shortest

    path by considering the travel time, (b) penalization of the links

    belonging to the current shortest path with a factor equal to 5% of

    the travel time, and (c) computation of the next-shortest path. The

    penalizing factor is a compromise between a low value for which

    the same path is identified repeatedly and a high value for whichlonger paths are generated before routes that are more similar to the

    shortest path.

    Two simulation approaches are implemented by computing theshortest path for each draw of impedances of the links belonging to

    the network. The two approaches exploit the same procedure to draw

    impedances from a truncated normal distribution characterized from

    the following parameters: (a) mean equal to the travel time, (b) vari-

    ance equal to a percentage of the mean, (c) left truncation limit equal

    to the free-flow time, and (d) right truncation limit equal to the travel

    time calculated for a minimum speed assumed equal to 10 km/h. The

    first simulation approach sets the variance equal to 20% of the mean

    CIr

    nr

    n

    N

    O

    N O= =

    ,maxmax

    ( )1 8

    22 Transportation Research Record 1985

    and extracts 25 draws. The second simulation approach defines the

    variance equal to the mean and extracts 35 draws.

    For all algorithms based on the shortest-path search, the number

    of iterations or draws is determined by the asymptotically decreasing

    ability of each technique to generate unique routes with an increas-

    ing number of repetitions, and the postprocessing step consists of the

    elimination of duplicated paths.

    The generated path sets are compared with respect to the observed

    routes, and the resulting coverage measures the goodness of fit of each

    method. Considering the number of reproduced routes and the differ-

    ent nature of each technique and looking for consistency of path sets

    with the observed behavior, two different choice sets are built con-

    sidering the same reproduced observations: the first contains all the

    paths generated by the approaches relying on the shortest-path search,

    and the second consists of all the paths generated by the branch-and-

    bound algorithm. Six route choice models are estimated and compared

    within the different choice sets: multinomial logit (MNL), C-logit,

    path-size logit (PSL), generalized-nested logit (GNL), cross-nested

    logit (CNL), and link-nested logit (LNL).

    RESULTS OF PATH GENERATION ALGORITHMS

    The application of the described methodology allows comparison

    of coverage results from path sets generated with different methods

    and model performance within different choice sets.

    Coverage Measures of Path

    Generation Algorithms

    The database of the Web-based survey responses contains 276 ob-

    servations. Incomplete observations, presenting either missing val-

    ues in the attitudinal section or any incorrectly coded route in the

    path-recording section, are excluded from consideration.Of the 236 observations entering the clean data set 90% contain

    information not only about the chosen route but also about the alter-

    native routes considered for reaching the workplace. A total of 236

    actual chosen routes and 339 possible alternatives, covering 182 dif-

    ferent origindestination pairs, constitute the comparison term for

    measuring the coverage of the path generation algorithms described

    in the methodological section.

    Table 1 presents coverage results according to different overlap

    thresholds varying from complete replication to the reproduction of

    70% of the collected routes.

    Each of the single labels performs weakly, which suggests that

    the actual behavior of habitual drivers does not usually correspond to

    the shortest-path selection. The analysis of the combined effect of the

    four labels shows that only 40% of the chosen routes are replicated,

    whereas almost 45% are reproduced with 80% overlap threshold.

    The link elimination approach duplicates almost 60% and covers

    almost 70% of the observed routes with an 80% overlap threshold.

    It also outperforms the link penalty and simulation techniques, with

    a small variance for the probability distribution. The simulation

    method with large variance surpasses the thresholds of 60% of the

    replicated routes and 70% of at least 80% overlapping routes.

    The branch-and-bound algorithm largely outperforms each single-

    generation algorithm by replicating over 90% and reproducing over

    96% of the observed routes. The branch-and-bound algorithm also

    slightly outperforms the path set resulting from the combination of all

  • 7/27/2019 370_2617318598.pdf

    5/10

    Prato and Bekhor 23

    null overlap, mainly among the alternatives rather than among the

    chosen paths and mainly for shortest-path-based techniques rather

    than for the branch-and-bound algorithm.

    Behavioral consistency constitutes the most desirable property of

    the generation techniques, but providing guidelines for selecting the

    preferred method should consider the trade-off with computational

    performance. Table 2 shows the computational costs for the tech-

    niques applied to this case study and allows examination of the

    trade-off between implementation time and consistency.For each origindestination pair, minimizing each label requires

    time for shortest-path calculation. The link elimination and link pe-

    nalty approaches demand longer times for rewriting the network by

    removing one link or updating costs at each iteration. The simulation

    approaches require additional time for drawing random travel times

    before the shortest path is calculated. The branch-and-bound method

    exploits partial path subsegments calculated for different origin

    destination pairs, with consequent reduction of computational costs

    while additional origindestination pairs are processed.

    The labeling approach shows excessive inconsistency with respect

    to the observed behavior and provides limited increase in coverage

    with additional labels, and consequently further implementation of the

    method appears inadvisable. The link penalization, link elimination,

    and simulation techniques present asymptotical behavior with respectto the ability to produce unique routes and consequently additional

    iterations give reduced gain in coverage.

    The branch-and-bound technique gives excellent results in terms

    of coverage, computational costs similar to those of techniques

    largely outperformed, and shows a good trade-off between behav-

    ioral consistency and implementation time. For example, with time

    comparable with that of the simulation method with small variance,

    the 40% coverage increase of replicated routes indicates that the

    proposed algorithm performs excellently.

    Choice Set Composition for Model Estimation

    Reproduction of the actual chosen route indicates the consistency ofeach algorithm with respect to the observed behavior of each respon-

    dent. Inconsistent observations for which path enumeration methods

    fail to reproduce the actual choice are excluded from choice sets pre-

    pared for model estimation.

    To visualize the consistency of the applied algorithms regarding the

    observed behavior, Figure 2 represents the distribution of coverage

    over the cumulative percentage of observations.

    The ideal algorithm would replicate all the observed routes and

    draw a horizontal line at the 100% overlap threshold. The area below

    TABLE 1 Coverage Resu lts of Appli ed Alg or ith ms

    Overlap Threshold

    Algorithm 100% 90% 80% 70%

    Chosen routes

    Labeling approach 26.69 26.69 30.93 34.32(least length)

    Labeling approach 26.27 26.27 27.54 29.24(least free flow time)

    Labeling approach 17.80 17.80 18.22 22.46(least travel time)

    Labeling approach 21.19 21.19 22.88 23.73(least delay)

    Link elimination 58.47 58.47 69.92 81.78approach

    Link penalty approach 53.81 53.81 62.29 68.22

    Simulation with small 49.15 49.15 54.24 59.32variance

    Simulation with large 61.44 61.86 71.19 81.36variance

    Branch-and-bound 91.10 91.53 96.61 97.88algorithm

    Alternative routesLabeling approach 13.27 13.57 22.42 26.25

    (least length)

    Labeling approach 21.53 21.53 22.42 30.68(least free flow time)

    Labeling approach 15.93 15.93 16.22 19.17(least travel time)

    Labeling approach 17.40 17.40 17.99 20.35(least delay)

    Link elimination 54.87 54.87 66.37 76.11approach

    Link penalty approach 43.95 43.95 51.62 61.65

    Simulation with small 42.77 42.77 48.08 61.06variance

    Simulation with large 55.16 55.16 64.90 76.11variance

    Branch-and-bound 82.60 82.60 89.38 95.58algorithm

    TABLE 2 Compu tatio na l Performance of App lied Algor ithms

    Algorithm Computational Time Number of Unique Routes Consistency Index

    Labeling approach (length) 1.5 h 182 53.54

    Labeling approach (free flow time) 1.5 h 182 49.36

    Labeling approach (travel time) 1.5 h 182 43.34

    Labeling approach (delay) 1.5 h 182 44.46

    Link elimination approach 36 h 958 87.16

    Link penalty approach 54 h 1164 81.29

    Simulation with small variance 42 h 1097 75.49

    Simulation with large variance 58 h 3305 88.12

    Branch-and-bound method 40 h 2038 97.91

    NOTE: Computations performed using Borland Pascal and Microsoft Excel XP on an Intel Pentium IV 3.06 GHz with 512 MbRAM running Windows XP Home.

    the other methods since the coverage of the merged path set duplicates

    87% of the routes and covers the 95% with an 80% overlap threshold.

    Results show the tendency of all algorithms to perform better

    with respect to the actual choices than to the possible alternatives

    considered by the respondents. More than 700 generated routes are

    completely inconsistent with the observed behavior by presenting

  • 7/27/2019 370_2617318598.pdf

    6/10

    each line representing the distribution of the coverage measures the

    consistency of each algorithm with respect to the rectangular area

    that the ideal algorithm would individuate. The consistency of the

    applied methods varies from 67.2% for the labeling approach to

    97.9% for the branch-and-bound method.

    With the objective of constructing choice sets for model estimation,

    the 80% overlap threshold individuates the observations to be in-

    cluded for each algorithm. With the objective of increasing the relia-

    bility of choice set comparison by considering the same observations

    and a high number of observations, two choice sets are constructed.

    The first choice set merges the path sets generated with the algo-

    rithms based on the shortest-path search, since each single algorithm

    reproduces a limited number of observations. The second choice set

    corresponds to the path set obtained with the branch-and-bound tech-

    nique. Both choice sets account for the same 223 observations and

    results consistent with the same observed behavior.

    Figure 3 illustrates differences between the two choice sets in

    terms of number of alternatives and number of links for each obser-

    vation. For the choice set obtained by merging paths generated with

    different algorithms, the median size counts 32 routes, more than

    one quarter of the observations contain at least 40 paths, and the

    maximum number of alternatives reaches 55. For the choice set

    generated with the branch-and-bound technique the median size is

    17 routes, only 6% of the observations consist of 40 paths or more,

    and the maximum number of alternatives is 44.

    Both choice sets include a high number of alternatives and pre-

    sumably contain routes that drivers would not consider. Considering

    the number of links for each observation with respect to the merged

    path set, the branch-and-bound path set presents a higher ratio be-

    tween links and routes. This finding indicates that the paths share

    fewer links and are most likely to be more heterogeneous.

    24 Transportation Research Record 1985

    Route Choice Model Estimation

    Data sets for model estimation account for

    Level-of-service variables, such as distance, free-flow time, andtravel time;

    Landmark dummy variables, equal to 1 if the route crosses thelandmark and zero otherwise; and

    Behavioral variables, measured at the individual level byapplying factor analysis to the behavioral indicators (16).

    In contrast to the MNL model, different model specifications

    account for similarity among alternatives and require the estima-

    tion of additional parameters. The C-logit and PSL models main-

    tain the logit structure by including a correction term within the

    deterministic part of the utility function. The following formula-

    tions for commonality factor and path size are applied for C-logit

    and PSL estimation (18, 19):

    where

    CFk = commonality factor of route k,Lk = length of route k,Ll = length of each route l in choice set Cn,

    Lkl = common length between routes kand l, and0 = 1.

    CFkkl

    k ll C

    k l

    k kl

    t

    L

    L L

    L L

    L Ln

    = +

    0 1lnkkl

    ( )9

    0

    10

    20

    30

    40

    50

    60

    70

    80

    90

    100

    0 20 40 60 80

    cumulative percentage of the observations

    overlapthreshold

    100

    branch & bound simulation with large variance simulation with small variance

    link elimination link penalty labeling approach

    FIGURE 2 Distribution of coverage over 236 observations.

  • 7/27/2019 370_2617318598.pdf

    7/10

    where

    PSk = path size of route k,k = set of links of route k,La = length of linka,al = linkpath incidence dummy (equal to 1 if route l uses link

    a, and zero otherwise), and

    = positive parameter.

    Generalized nested structures relate the model coefficients to the

    network topology for adaptation to route choice. The GNL, CNL,

    and LNL models consider the following functional relationship for

    the inclusion coefficients (20):

    where

    mk = inclusion coefficients (with 0 mk 1),Lm = length of linkm,Lk = length of route k, andmk = linkpath incidence dummy (equal to 1 if route kuses link

    m and zero otherwise).

    For GNL estimation, nesting coefficients are considered unique

    for each nest and are expressed with the following parameterized

    formulation (21):

    mkm

    k

    mk

    L

    L= ( )11

    PSkq

    k k

    t

    al

    l C

    a

    L

    L L

    Ln

    k

    =

    1 10

    ( )

    Prato and Bekhor 25

    where m are the nesting coefficients (with 0 m 1) and is aparameter to be estimated.

    CNL is a particular case of the GNL model, in which all the links

    share a common nesting coefficient m to be estimated. LNL is a

    particular case of the CNL model, in which the common nesting

    coefficient m approaches zero and is not estimated (22).

    Tables 3 and 4 illustrate the best estimates for route choice models

    considering both choice sets.

    The same interpretation of the results is possible for both choice sets.

    Parameter estimates suggest that choices are influenced by experience

    since habit and familiarity negatively influence the utility whereas

    landmarks positively affect drivers behavior. The same conclusions

    about the goodness of fit across models are reachable for both choice

    sets. The LNL model largely outperforms all other models, and PSL

    and C-Logit models also produce very good results. GNL and CNL

    models tend to collapse to MNL and present worse results than MNL.

    Statistical comparison across data sets is not available, for the

    impossibility of measuring the covariance across data sets that are

    correlated since they also include similar routes. The better likeli-

    hood ratio index obtained for the branch-and-bound choice set is

    justified by the lower number of alternatives in the branch-and-

    bound choice set, which leads to higher probabilities of choosing the

    observed choice and consequently to higher likelihood values.

    Since the choice sets include the same observations and the

    same conclusions are drawn in terms of result interpretation and

    model comparison, the analysis of the choice sets suggests that the

    m

    ml

    l C

    ml

    l C

    n

    n

    =

    1 12( )

    0

    5

    10

    15

    20

    25

    30

    35

    40

    45

    50

    55

    numberofuniqueroute

    s

    shortest pathbased methods branch-&-bound algorithm

    0 20 40 60 80

    cumulative percentage of the observations

    100

    FIGURE 3 Distribution of number of alternatives in choice sets.

  • 7/27/2019 370_2617318598.pdf

    8/10

    TABLE3

    ModelEstimationwithMergedChoiceSet

    MNL

    C-Logit

    PSL(=

    9)

    GNL

    CNL

    LNL

    Variable

    Estimate

    t-Stat.

    Estimate

    t-Stat.

    Estimate

    t-Stat.

    Estimate

    t-Stat.

    Estimate

    t-Stat.

    Estimate

    t-Stat.

    Distance

    0.882

    3.58

    1.025

    4.36

    1.028

    4.41

    0.80

    3

    3.84

    1.215

    2.91

    0.929

    2.97

    Traveltime

    0.422

    6.29

    0.412

    6.44

    0.416

    6.46

    0.30

    5

    3.72

    0.572

    3.38

    0.234

    2.83

    Sabotinosquaredummy

    2.692

    6.10

    2.544

    5.85

    2.156

    5.17

    2.15

    4

    4.53

    3.837

    3.53

    2.357

    4.37

    Adrianosquaredummy

    1.113

    2.51

    1.113

    2.46

    1.148

    2.58

    0.81

    6

    2.35

    1.418

    1.99

    1.214

    2.12

    Sommeillerbridgedummy

    4.083

    9.06

    4.040

    8.69

    3.659

    8.47

    3.21

    1

    5.18

    5.640

    3.83

    3.684

    6.64

    Dantebridgedummy

    3.256

    4.97

    3.335

    5.06

    3.054

    4.88

    2.68

    9

    4.16

    4.509

    3.31

    3.279

    3.91

    Rivolisquaredummy

    1.061

    2.89

    1.449

    3.92

    1.214

    3.45

    0.84

    3

    2.72

    1.490

    2.32

    0.968

    1.92

    Berninisquaredummy

    0.648

    1.34

    0.379

    0.81

    0.545

    1.14

    0.52

    8

    1.44

    0.805

    1.07

    0.653

    0.90

    Acajasquaredummy

    0.527

    1.31

    0.195

    0.50

    0.309

    0.79

    0.27

    9

    0.92

    0.467

    0.77

    0.432

    0.76

    Habit

    0.482

    2.73

    0.514

    2.97

    0.530

    3.14

    0.26

    4

    2.33

    0.996

    1.88

    0.478

    2.94

    Spatialability

    0.225

    1.23

    0.239

    1.34

    0.241

    1.36

    0.07

    7

    0.90

    0.222

    0.61

    0.082

    0.50

    Familiarity

    0.154

    0.97

    0.163

    1.09

    0.199

    1.34

    0.17

    6

    2.75

    0.310

    0.93

    0.649

    4.46

    Commonalityfactor

    1.197

    4.45

    Lnofpathsize

    4.245

    5.89

    Commonnestingcoefficient

    0.986

    3.49

    Gammauniquenestingco

    efficient

    4.24

    5

    1.35

    Loglikelihoodatestimates

    592.30

    58

    3.46

    575.45

    597.19

    596.49

    442.58

    Likelihoodratioindex

    0.175

    0.188

    0.199

    0.169

    0.170

    0.384

    N=2

    23;loglikelihoodforall

    coefficientsatzerois718.35.

  • 7/27/2019 370_2617318598.pdf

    9/10

    TABLE4

    ModelEstimationwithBranch-and-BoundChoiceSet

    MNL

    C-Logit

    PSL(=

    9)

    GNL

    CNL

    LNL

    Variable

    Estimate

    t-Stat.

    Estimate

    t-Stat.

    Estimate

    t-Stat.

    Estimate

    t-Stat.

    Estimate

    t-Stat.

    Estimate

    t-Stat.

    Distance

    1.544

    6.29

    1.271

    5.13

    1.033

    4.04

    1.617

    5.52

    1.548

    4.24

    1

    .299

    4.46

    Traveltime

    0.355

    5.29

    0.335

    5.13

    0.332

    5.18

    0.302

    4.06

    0.347

    3.63

    0

    .160

    2.07

    Sabotinosquaredummy

    1.930

    4.33

    1.746

    3.99

    1.465

    3.34

    1.775

    3.66

    1.854

    3.23

    1

    .632

    3.17

    Adrianosquaredummy

    1.138

    2.56

    1.169

    2.69

    1.159

    2.71

    1.070

    2.30

    1.165

    2.16

    1

    .112

    1.82

    Sommeillerbridgedummy

    3.089

    8.00

    2.903

    7.47

    2.720

    7.08

    2.748

    5.76

    2.979

    4.20

    2

    .478

    4.54

    Dantebridgedummy

    1.846

    3.27

    1.936

    3.43

    1.903

    3.41

    1.630

    2.83

    1.592

    2.44

    2

    .049

    2.49

    Rivolisquaredummy

    0.345

    1.13

    0.745

    2.57

    0.555

    1.99

    0.141

    0.43

    0.103

    0.23

    0

    .338

    0.82

    Berninisquaredummy

    1.006

    2.02

    0.619

    1.31

    0.686

    1.48

    1.458

    2.59

    1.434

    2.05

    0

    .883

    1.34

    Acajasquaredummy

    0.605

    1.45

    0.250

    0.64

    0.288

    0.75

    0.830

    1.86

    0.811

    1.54

    0

    .500

    0.91

    Habit

    0.319

    2.34

    0.343

    2.53

    0.338

    2.51

    0.294

    2.12

    0.445

    1.76

    0

    .296

    2.10

    Spatialability

    0.270

    1.86

    0.272

    1.87

    0.297

    2.05

    0.219

    1.50

    0.147

    0.59

    0

    .212

    1.28

    Familiarity

    0.209

    1.76

    0.227

    1.95

    0.309

    2.61

    0.114

    0.94

    0.165

    0.77

    0

    .352

    2.88

    Commonalityfactor

    1.218

    2.88

    Lnofpathsize

    3.455

    4.33

    Commonnestingcoefficien

    t

    0.941

    4.63

    Gammauniquenestingco

    efficient

    0.091

    0.06

    Loglikelihoodatestimates

    466.27

    46

    2.28

    457.19

    470.79

    463.22

    364.89

    Likelihoodratioindex

    0.223

    0.230

    0.238

    0.215

    0.228

    0.392

    N=2

    23;loglikelihoodforallcoefficientsatzerois600.07.

  • 7/27/2019 370_2617318598.pdf

    10/10

    branch-and-bound algorithm performs better from the behavioral

    perspective by reproducing the actual chosen routes while including

    more heterogeneous alternatives in smaller choice sets.

    SUMMARY AND CONCLUSIONS

    A path enumeration algorithm based on the branch-and-bound tech-

    nique is proposed. This method is applicable to any urban networksince its implementation requires existing resources and computa-

    tional speed depends more on the tree depth than on the tree width.

    Implementation in a real case study shows the applicability of the

    method and evaluates the performance of the algorithm.

    Labeling, link elimination, link penalty, and simulation approaches

    and branch-and-bound techniques are applied to an urban network. A

    comparison with respect to actual routes chosen by individuals driv-

    ing habitually from home to work shows that the proposed algorithm

    is significantly better from the perspective of behavioral efficiency

    with respect to the ideal algorithm. In parallel, the designed technique

    shows a good trade-off between computational costs and efficiency

    with respect to the methods that demonstrate a small performance

    increase with increasing implementation time.

    Construction of different choice sets characterized by similar behav-ioral efficiencyone consisting of a path set resulting from the com-

    bination of methods relying on the shortest-path search and the other

    resulting from the application of the branch-and-bound algorithm

    enables evaluation of results and comparison of model estimation. The

    two choice sets produce estimates that are qualitatively comparable

    and suggest similar conclusions in terms of model comparison.

    Results from model estimation suggest the need for further inves-

    tigation into generalized nested structures, since GNL and CNL mod-

    els tend to collapse to the MNL model and the LNL model largely

    outperforms all other route choice models. Another area for fur-

    ther investigation is the route choice mechanism, since the influence

    of habit and landmarks on the utility suggests that distance and travel

    time are not the only elements considered in choosing a route.

    The parameters for defining the constraints in the bounding rulewere arbitrarily defined in this study on the basis of common sense.

    Further research is needed to verify the sensitivity of the branch-

    and-bound algorithm to the constraints parameters and to measure

    the effectiveness of the proposed method with respect to different

    data sets.

    ACKNOWLEDGMENTS

    The authors are grateful to the anonymous reviewers, who provided

    many insightful comments and corrections to improve this paper.

    REFERENCES

    1. Cascetta, E., F. Russo, and A. Vitetta. Stochastic User EquilibriumAssignment with Explicit Path Enumeration: Comparison of Modelsand Algorithms. Proc., 8th IFAC Symposium on Transportation Systems,Technical University of Crete, Chania, 1997, pp. 10781084.

    2. Prashker, J. N., and S. Bekhor. Route Choice Models used in theStochastic User Equilibrium Problem: A Review. Transport Reviews,Vol. 24, 2004, pp. 437463.

    3. Dijkstra, E. W. Note on Two Problems in Connection with Graphs.Numerical Mathematics, Vol. 1, 1959, pp. 269271.

    4. Van der Zijpp, N. J., and S. Fiorenzo-Catalano. Path Enumeration byFinding the Constrained K-Shortest Paths. Transportation Research,Vol. 39B, 2005, pp. 545563.

    5. Ben-Akiva, M., M. J. Bergman, A. J. Daly, and R. Ramaswamy. Mod-eling Inter-Urban Route Choice Behaviour. Proc., of the Ninth Inter-national Symposium on Transportation and Traffic Theory, VNU SciencePress, Utrecht, Netherlands, 1984, pp. 299330.

    6. Dial, R. B. Bicriterion Traffic Equilibrium: T2 Model, Algorithm, and

    Software Overview. In Transportation Research Record: Journal of theTransportation Research Board, No. 1725, Transportation ResearchBoard of the National Academies, Washington, D.C., 2000, pp. 5462.

    7. Azevedo, J. A., M. E. O. Santos Costa, J. J. E. R. Silvestre Madera, andE. Q. Vieira Martins. An Algorithm for the Ranking of Shortest Paths.

    European Journal of Operational Research,Vol. 69, 1993, pp. 97106.8. De la Barra, T., B. Perez, and J. Anez. Multidimensional Path Search

    and Assignment. Proc., 21st PTRC Summer Annual Meeting, PTRCEducation and Research Services Ltd., London, 1993.

    9. Park, D., and L. R. Rilett. Identifying Multiple and Reasonable Paths inTransportation Networks: A Heuristic Approach. In Transportation

    Research Record 1607, TRB, National Research Council, Washington,D.C., 1997, pp. 3137.

    10. Scott, K., G. Pabon-Jimenez, and D. Bernstein. Finding Alternatives tothe Best Path. Presented at 76th Annual Meeting of the TransportationResearch Board, Washington, D.C., 1997.

    11. Sheffi, Y., and W. B. Powell. An Algorithm for the Equilibrium

    Assignment Problem with Random Link Times. Networks , Vol. 12,1982, pp. 191207.

    12. Fiorenzo-Catalano, S., and N. J. Van der Zijpp. A Forecasting Model forInland Navigation Based on Route Enumeration. Proc., EuropeanTransport Conference, PTRC Education and Research Services Ltd.,London, 2001, pp. 111.

    13. Bekhor, S., M. E. Ben-Akiva, and S. Ramming. Route Choice: ChoiceSet Generation and Probabilistic Choice Models. Proc., 4th TRISTANConference, Azores, Portugal, 2001.

    14. Friedrich, M., I. Hofsss, and S. Wekeck. Timetable-Based TransitAssignment Using Branch and Bound Technique. In Transportation

    Research Record: Journal of the Transporta tion Research Board,No. 1752, TRB, National Research Council, Washington, D.C.,2001, pp. 100107.

    15. Hoogendoorn-Lanser, S.Modelling Travel Behaviour in Multi-modalNetworks. TRAIL Research School, Netherlands, 2005.

    16. Prato, C. G., S. Bekhor, and C. Pronello. Methodology for Exploratory

    Analysis of Latent Factors Influencing Drivers Behavior. In Transporta-tion Research Record: Journal of the Transportation Research Board,

    No. 1926, Transportation Research Board of the National Academies,Washington, D.C., 2005, pp. 115125.

    17. Prato, C. G.Latent Factors and Route Choice Behaviour. Ph.D. thesis.Turin Polytechnic, Italy, 2005.

    18. Cascetta, E., A. Nuzzolo, F. Russo, and A. Vitetta. A Modified LogitRoute Choice Model Overcoming Path Overlapping Problems: Speci-fication and Some Calibration Results for Interurban Networks. Proc.,Thirteenth International Symposium on Transportation and TrafficTheory, Pergamon, Lyon, France, 1996, pp. 697711.

    19. Ramming, S. Network Knowledge and Route Choice. Ph.D. thesis.Massachusetts Institute of Technology, Cambridge, Mass., 2001.

    20. Prashker, J. N., and S. Bekhor. Investigation of Stochastic NetworkLoading Procedures. In Transportation Research Record 1645, TRB,National Research Council, Washington, D.C., 1998, pp. 94102.

    21. Bekhor, S., and J. N. Prashker. Stochastic User Equilibrium Formulationfor Generalized Nested Logit Model. In Transportation Research

    Record: Journal of the Transportation Research Board, No. 1752,Trans-portation Research Board of the National Academies, Washington, D.C.,2001, pp. 8490.

    22. Vovsha, P., and S. Bekhor. Link-Nested Logit Model of Route Choice:Overcoming the Route Overlapping Problem. In Transportation Re-search Record 1645, TRB, National Research Council, Washington,D.C., 1998, pp. 133142.

    The Traveler Behavior and Values Committee sponsored publication of this paper.

    28 Transportation Research Record 1985