Phd Dissertation

Econometric Models in Transportation

Publication Date

June 14, 2015

Author(s)

Abstract

The three chapters in this dissertation study and apply econometric models to answer questions in transportation economics. Chapter 1 and 2 analyze the Berry, Levinsohn and Pakes (BLP) discrete choice model for combined micro- and macro-level data. Chapter 1 considers the concerns of choice set aggregation and estimating consistent standard errors within the BLP Model. These concerns are studied within the context of a vehicle choice application with interest in estimating household valuation of fuel efficiency. Chapter 2 studies the numerical properties of the maximum likelihood approach to estimating this BLP model. Chapter 3 applies a Poisson-Log Normal panel data model to study the effect of red light cameras on collision counts in Los Angeles. The camera program suffered from weaknesses in enforcement that dampened the effectiveness of the program over time. The model considered here controls for this dampening effect. Chapter 1 finds that choice set aggregation affects the point estimates obtained from the BLP model and causes standard errors to be too small. The use of inconsistent sequential standard errors also underestimates the magnitude of standard errors. These findings may partly explain the disparity across existing estimates from choice models on the value households place on vehicle fuel efficiency. Chapter 2 finds that the maximum likelihood estimation approach is able to find the global minimum regardless of choice of starting values, optimization routine used and tightness of convergence criteria. These findings highlight the benefits of estimating the BLP model on combined micro- and macro-level datasets using the maximum likelihood approach compared to using the nested fixed point approach and only macro level data where numerical stability is difficult to obtain. Chapter 3 finds that controlling for the dampening effect from poor enforcement, the Los Angeles Automated Red Light Camera program decreased red light running related collisions but increased right-angle and injury collisions, as well as collisions overall.

Phd Dissertation

Analysis of discrete data models with endogeneity, simultaneity, and missing outcomes

Publication Date

June 14, 2015

Author(s)

Abstract

This thesis is concerned with specifying and estimating multivariate models in discrete data settings. The models are applied to several empirical applications with an emphasis in banking and monetary history. The approaches presented here are of central importance in model evaluation, policy analysis, and prediction. The first chapter develops a framework for estimating multivariate treatment effect models in the presence of sample selection. The methodology deals with several important issues prevalent in program evaluation, including non-random treatment assignment, endogeneity, and discrete outcomes. The framework is applied to evaluate the effectiveness of bank recapitalization programs and their ability to resuscitate the financial system. This paper presents a novel bank-level data set and employs the new methodology to jointly model a bank’s decision to apply for assistance, the central bank’s decision to approve or decline the assistance, and the bank’s performance. The article offers practical estimation tools to unveil new answers to important regulatory and government intervention questions. The second chapter examines an important but often overlooked obstacle in multivariate discrete data models which is the proper specification of endogenous covariates. Endogeneity can be modeled as latent or observed, representing competing hypotheses about the outcomes of interest. This paper highlights the use of existing Bayesian model comparison techniques to understand the nature of endogeneity. Consideration of both observed and latent modeling approaches is emphasized in two empirical applications. The first application examines linkages for banking contagion and the second application evaluates the impact of education on socioeconomic outcomes. The third chapter, which is joint work with Professor Ivan Jeliazkov, studies the formulation of the likelihood function for simultaneous equation models for discrete data. The approach rests on casting the required distribution as the invariant distribution of a suitably defined Markov chain. The derivation resolves puzzling paradoxes highlighted in earlier work, shows that such models are theoretically coherent, and offers simple and intuitive linkages to the better understood analysis of continuous outcomes. The new methodology is employed in two applications involving simultaneous equation models of (i) female labor supply and family financial stability, and (ii) the interactions between health and wealth.

Phd Dissertation

Human activity recognition : a data-driven approach

Abstract

In this research, we propose a series of models designed to take advantage of the availability of data–both structured and unstructured–from a variety of sources ranging including passive data, questionnaires, and social media data to analyze underlying patterns and trends in travel and activity behavior, and to provide results that support enhancements both in transportation planning and the application of programming to support such efforts. First, we introduce a framework for automatically inferring the travel modes and trip purposes of human movement when tracked by a GPS device. We utilize a multiple changepoints algorithm to divide trajectories into segments using only speed data. Then, Random Forest is used to classify segments into moving and not-moving types. For moving segments, travel mode (car, bus, train, walk, and bike) is predicted. Next, multiple machine learning algorithms are employed, validated, and tested to identify the most suitable model for inferring trip purposes. The overall accuracy for prediction is over 80% on the testing set, both with and without data on socio-demographic variables. The model also predicts “shop” trips with an accuracy of 92.1%, while its accuracy for “go home” and “studying” trips reaches 100%. Additionally, we utilize the classification results in the first stage of research to compare households’ travel patterns from before a new light rail transit line began service to two periods of time after service began. Our results indicate that, although the average of activity duration varies significantly over days of week and waves, the random effect of these two factors on activity duration was minor; time of day contributed over one third of the total variance in the duration. Finally, the dissertation demonstrates uses of Twitter data as a potentially important data source to understand comments, criticisms, and responses about light rail in Los Angeles. The results of information flow analysis, sentiment estimation, topic modeling, and its application can be useful for exploring trends among commuters and how their sentiment changed according to the light rail line they used, the time of day, and the day of the week

Phd Dissertation

Exploratory Dynamic Models of Alternative Fuel Vehicle Adoption

Abstract

Identifying socioeconomic characteristics and vehicle characteristics, including a market share of a specific vehicle, influencing on a choice of a vehicle is important for forecasting demands for alternative fuel vehicles (AFVs). Over the time, how changes in these characteristics will affect on the demands is also important. And by connecting with supply, how changes in demands for AFVs will make an effect on the supplies becomes important. This paper forecasts market shares of AFVs in demands and supplies.

First, in a demand part, a dataset of National Household Travel Survey in 2009 is used to identify factors which influence on a choice of AFVs by logit models. And then by using coefficients from the logit models, a dynamic normative model is proposed to forecast demands for Toyota Prius, a sort of hybrid vehicles, with respect to changes in characteristics such as a gas price and a vehicle price. Because a dynamic normative model is a simulation model with unknown values of parameters, these values are randomly defined to track the changes in market shares of Prius based on an annual vehicle market share data.

Next, in a supply part, proportions of hydrogen fuel cell vehicles (HFCVs) with respect to the density of hydrogen refueling stations are estimated by logit models. And then by using these results, a competition model is proposed to forecast supplies for HFCVs. Forecasting supplies for HFCVs is based on demands which is forecasted from a dynamic normative model.

Last, it is found that supplies of HFCVs from the competition model exceed affordable numbers of themselves for the market, because the demands for HFCVs from a dynamic normative model don’t consider affordable numbers of HFCVs for the market. Therefore, to connect results from two models, feedback methods are used.

The results indicate that the market share of AFVs will exceed that of ICEs when: 1) a gasoline price is increased, 2) a vehicle price of AFVs is decreased, 3) the initial market share of AFVs is large, and 4) the density of refueling stations is increased.

Phd Dissertation

Integration of Weigh-In-Motion and Inductive Signature Data

Abstract

Transportation agencies tasked with forecasting freight movements, creating and evaluating policy to mitigate transportation impacts on infrastructure and air quality, and furnishing the data necessary for performance driven investment depend on quality, detailed, and ubiquitous vehicle data. Unfortunately, commercial vehicle data is either missing or expensive to obtain from current resources. To overcome the drawbacks of existing commercial vehicle data collection tools and leverage the already heavy investments into existing sensor systems, a novel approach of integrating two existing data collection devices to gather high resolution truck data – Weigh-in-motion (WIM) systems and advanced inductive loop detectors (ILD) is developed in this dissertation. Each source provides a unique data set that when combined produces a synergistic data source that is particularly useful for truck body class modeling. Modelling truck body class, rather than axle configuration, provides more detailed depictions of commodity and industry level truck movements. Since body class is closely linked to commodity carried, drive and duty cycle, and other operating characteristics, it is inherently useful for each of the above mentioned applications. In this work the physical integration including hardware and data collection procedures undertaken to develop a series of truck body class models is presented. Approximately 35,000 samples consisting of photo, WIM, and ILD signature data were collected and processed representing a significant achievement over previous ILD signature models which were limited to around 1,500 commercial vehicle records. Three families of models were developed, each depicting an increasing level of input data and output class resolution. The first uses WIM data to estimate body class volumes of five semi-trailer body types and individual predictions of two tractor body classes for vehicles with five axle tractor trailer configurations. The trailer model produces volume errors of less than 10% while the tractor model resulted in a correct classification rate (CCR) of 92.7%. The second model uses ILD signatures to predict 47 vehicle body classes using a multiple classifier system (MCS) approach coupled with the Synthetic Minority Oversampling Technique (SMOTE) for preprocessing the training data samples. Tests show the model achieved CCR higher than 70% for 34 of the body classes. The third and most complex model combines WIM and ILD signatures using to produce 63 body class designations, 52 with CCR greater than 70%. To highlight the contributions of this work, several applications using body class data derived from the third model are presented including a time of day analysis, average payload estimation, and gross vehicle weight distribution estimation.

Phd Dissertation

Macroscopic modeling and analysis of urban vehicular traffic

Publication Date

December 30, 2014

Author(s)

Abstract

A macroscopic relation between the network-level average flow-rate and density, which is known as the macroscopic fundamental diagram (MFD), has been shown to exist in urban networks in stationary states. In the literature, however, most existing studies have considered the MFD as a phenomenon of urban networks, and few have tried to derive it analytically from signal settings, route choice behaviors, or demand patterns. Furthermore, it is still not clear about the definition or existence of stationary traffic states in urban networks and their stability properties. This dissertation research aims to fill this gap.

I start to study the stationary traffic states in a signalized double-ring network. A kinematic wave approach is used to formulate the traffic dynamics, and periodic traffic patterns are found using simulations and defined as stationary states. Furthermore, traffic dynamics are aggregated at the link level using the link queue model, and a Poincare map approach is introduced to analytically define and solve possible stationary states. Further results show that a stationary state can be Lyapunov stable, asymptotically stable, or unstable. Moreover, MFDs are explicitly derived such that the network flow-rate is a function of the network density, signal settings, and route choice behaviors. Also the time for the network to be gridlocked is analytically derived.

Even with the link queue model, traffic dynamics are still difficult to solve due to the discrete control at signalized junctions. Therefore, efforts are also devoted to deriving invariant continuous approximate models for a signalized road link and analyzing their properties under different capacity constraints, traffic conditions, traffic flow fundamental diagrams, signal settings, and traffic flow models. Analytical and simulation results show that the derived invariant continuous approximate model can fully capture the capacity constraints at the signalized junction and is a good approximation to the discrete signal control under different traffic conditions and traffic flow fundamental diagrams. Further analysis shows that non-invariant continuous approximate models cannot be used in the link transmission model since they can yield no solution to the traffic statics problem under certain traffic conditions.

For a signalized grid network, simulations with the link queue model confirm that important insights obtained for double-ring networks indeed apply to more general networks.

Phd Dissertation

Incorporating Individual Activity Arrival and Duration Preferences within a Time-of-day Travel Disutility Formulation of the Household Activity Pattern Problem (HAPP)

Publication Date

September 4, 2014

Author(s)

Abstract

This dissertation provides modifications and extensions to the Household Activity Pattern Problem (HAPP) to help move existing formulations from a laboratory prototype toward a more useable activity-based demand modeling product. Previous research on HAPP has been based on a pickup and delivery problem with time window constraints (PDPTW), which does not lend itself easily to application that is compatible with an activity-based forecasting model. Meanwhile, other research on activity-based modeling lacks of the integration of household decisions regarding time-of-day arrival, activity duration and traffic congestion effects on travel. We borrow concepts from economic research and consider that each household member tries to obtain maximum utility by choosing arrival time of activities, choosing activity duration while minimizing travel times and travel costs throughout the course of the day.

working paper

Determinants of Air Cargo Traffic in California

Publication Date

August 14, 2014

Abstract

Studies on the economic impact of air cargo traffic have been gaining traction in recent years. The slowed growth of air cargo traffic at California’s airports, however, has raised more pressing questions amongst airport planners and policy makers regarding the determinants of air cargo traffic. Specifically, it would be useful to know howCalifornia’s air cargo traffic is affected by urban economic characteristics surrounding airports. Accordingly, this study estimates the socioeconomic determinants of air cargo traffic across cities in California. We construct a 7-year panel (2003-2009) using quarterly employment, wage, population, and traffic data for metro areas in the state. Our results reveal that the concentration of service and manufacturing employment impacts the volume of outbound air cargo. Total air cargo traffic is found to grow faster than population, while the corresponding domestic traffic grows less than proportionally to city size. Wages play a significant role in determining both total and domestic air cargo movement. We provide point estimates for the traffic diversion between cities, showing that 80 percent of air cargo traffic is diverted away from a small city located within 100 miles of a large one. Using socioeconomic and demographic forecasts prepared for California’s Department of Transportation, we also forecast metro-level total and domestic air cargo tonnage for the years 2010-2040. Our forecasts for this period indicate that California’s total (domestic) air cargo traffic will increase at an average rate of 5.9 percent (4.4 percent) per year.

Phd Dissertation

Essays on Air Cargo Cost Structures, Airport Traffic, and Airport Delays: Panel Data Analysis of the U.S. Airline Industry

Publication Date

August 14, 2014

Author(s)

Abstract

The present thesis is comprised of four essays that address important gaps in passenger- and cargo-airline research. Seminal studies in airline economics that rely on cross-section methods make critical homogeneity assumptions and preclude time-specific effects. The essays in this thesis use panel data, which allow for certain assumptions made by cross-sectional studies to be relaxed, while shedding light on the intertemporal features of air transport.

The first chapter investigates the cost structure of air cargo carriers by applying a total cost model used in passenger-airline studies. Using quarterly panel data (2003-2011) on the domestic operations and costs of FedEx Express and UPS Airlines, empirical results indicate that the air cargo industry exhibits increasing returns to traffic density and constant returns to scale. Accounting for carrier-specific differences in cost structure and network size, FedEx is found to be more cost efficient than UPS (a finding that is reversed when network size is not controlled). Individually, UPS exhibits substantial economies of density and constant returns to scale while FedEx’s cost structure is characterized by weak economies of density and constant returns to scale. Both carriers exhibit economies of size.

The next three chapters embody papers that use quarterly panel data of city-level air traffic, airline delay, and socioeconomic variables. Spanning 10 years (2003-2012), the panel structure of the data permits the use of fixed effects to control for city-specific heterogeneity.

The second chapter presents a paper prepared for the Airport Cooperative Research Program (ACRP). The study demonstrates the within-city traffic impacts of urban size, employment composition, and wages, providing new insights into the determinants of passenger and air cargo traffic. The essay also confirms that airport traffic is proportional to population, and that service-sector employment and higher wages induce passenger travel and goods movement. A city’s share of manufacturing employment, however, only impacts air cargo traffic. Passenger enplanements exhibit more sensitivity to the proportion of urban workers providing non-tradable services, compared to the share of workers in tradable service jobs.

The third chapter, co-authored with Andre Tok, examines the determinants of air cargo traffic in California. The study uses a shorter 7-year panel (2003-2009), and shows that service and manufacturing employment impact the volume of outbound air cargo. Total (domestic) air cargo traffic is found to grow faster than (proportionally to) population, while wages play a significant role in determining both total and domestic air cargo movement. Metro-level air cargo tonnage are also forecasted for the years 2010-2040, indicating that California’s total (domestic) air cargo traffic will increase at an average rate of 5.9 percent (4.4 percent) per year in that period.

The final chapter is co-authored with Volodymyr Bilotkach, and it provides the first evidence on the impact of airline delays on urban-sectoral employment. Controlling for unobserved city-specific differences, the empirical estimates of the effects of air traffic on total employment are comparable to previously reported measures. However, service-sector employment is found to be less sensitive to air traffic than other studies suggested. New evidence confirming that delays have a negative impact on employment is also provided, a finding that is robust to various model specifications.