Phd Dissertation

Investment climate, female ownership and firm performance: a study of formal and informal firms across developing countries

Publication Date

May 31, 2012

Author(s)

Abstract

In this dissertation, I examine the associations between various firm and owner characteristics, and investment climate measures on different aspects of firm performance. I use unique and well-designed World Bank survey datasets on formal and informal firms to perform several empirical analyses to answer the primary questions of interest. This dissertation consists of three chapters. The first chapter explores and identifies the determinants of corporate R & D investment decisions across Indian firms, such as, property rights protection, related governmental corruptions (e.g., informal payments to government officials), firm ownership structure and access to finance. The second chapter examines the relationships between several investment climate measures and exit decisions of firms across Latin American countries, and find that weaknesses in investment climate has an adverse effect on firm dynamics and composition of surviving firms. The third chapter studies the effect of female ownership on various indicators of firm performance and willingness to register for informal (un-registered) firms across several developing countries. Overall, I find that the various measures of investment climate and institutional development, and female ownership have statistically significant and robust associations with various indicators of firm performance, investment decisions and survival. Several important interaction effects are also identified in these empirical analyses. The findings in this dissertation can help policy makers to understand the determinants and crucial drivers of firm performance across developing countries and within each country. This in turn may help them to design and implement policies that can positively impact firm performance (overall and for female-owned firms) and growth. Through this study, policy makers can also have a better understanding of the performance differences among male-owned and female-owned firms in the informal sector that may aid in formulating appropriate policy measures targeted towards female-owned firms and female entrepreneurs in the informal economy.

Phd Dissertation

Essays in urban and transportation economics

Publication Date

May 31, 2012

Author(s)

Abstract

This dissertation is comprised of three chapters. Chapter 1 and 2 investigate the interactions between land-use patterns and household travel and vehicle-choice patterns, both empirically and theoretically. Chapter 3 explores institutional aspects of the housing and the rental markets in Korea. Chapter 1, which is coauthored with David Brownstone, estimates the influence of residential density on vehicle usage and fuel consumption. The empirical model accounts for both residential self-selection effects and non-random missing data problems. While most previous studies focus on a specific region, this paper uses national samples from the 2001 National Household Travel Survey. The estimation results indicate that the joint effect of the contextual density measure (density in the context of its surrounding area) and residential density on vehicle usage is quantitatively larger than the sole effect of residential density. We also find that a lower neighborhood residential density induces consumer choices toward less fuel-efficient vehicles, which confirms the finding in Brownstone and Golob (2009). Motivated by a finding in Chapter 1 of this dissertation, Chapter 2 presents a modified monocentric city model, which incorporates the consumer’s optimal vehicle-type choice problem. Consumers are assumed to explicitly consider driving inconvenience in the choice of vehicle sizes, and the resulting commuting cost is a function of residential density. This vehicle-type choice problem is embedded in an otherwise standard monocentric city model. Comparative static analyses suggest that an increase in commuting cost per mile, especially from increased unit cost of driving inconvenience, may induce spatial expansion of the city. Part of comparative static analysis shows how the city’s vehicle fuel efficiency depends on the city characteristics such as population and agricultural rent. Chapter 3 explores a unique kind of rental contract to Korea, called chonsei. The tenant pays an upfront deposit, typically from 40% to 70% of the property value, to the landlord, and the landlord repays the deposit to the tenant upon contract termination. The main goal of this paper is to show why such a unique rental contract exists and has been popular in Korea. The model shows that chonsei is an ingenious market response in the era of “financial repression” in Korea (Renaud (1989)), allowing landlords to accumulate sufficient funds for housing investment without major reliance on a mortgage and providing renters with cheap rental housing. The model implies that the chonsei renter may save while the landlord and the owner-occupier put all their assets into housing and thus have no financial savings, the hypothesis that is empirically tested and confirmed

working paper

Shared-Taxi Operations with Electric Vehicles

Abstract

Electric Vehicles (EVs) are energy-efficient and often presented as a zero-emission transport mode to achieve longer-term decarbonization visions in the transport sector. The implementation of a sustainable transportation environment through EV utilization, however, requires the addressing of certain cost and environmental concerns, before its full potential can be realized. These include EVs’ limited driving range and issues related to battery charging. Taxis are visible and thus EV use in taxi service can bring attention in urban life to a commitment towards sustainability in the public’s opinion. For this reason, this study proposes an integrated approach incorporating EV operation and an appropriate shared-ride conceptual design for taxi service. Despite several obvious societal and environmental benefits, it is however true that EV use entails certain vehicle productivity loss due to the time lost in charging. As this could lead to a deterioration in system performance, and thus in demand as well, it is important to look at whether the expected performance loss from the passengers’ and systems’ standpoint can be offset with ingenuity in operational design. A combination of shared-taxi and EV fleet is proposed for this purpose, as it can be competitive in passenger travel and wait times with conventional non-EV taxis. Such systems are modeled and analyzed using simulation in this paper, under routing algorithms modified from previous research. More specifically, EV charging schemes for taxi service implementation were proposed and the effects of the limited driving range and battery charging details were examined from a system performance viewpoint. First, this study shows illustrative results on the impact of the EV taxi fleet’s vehicle charging on system performance. Then, real-time shared-taxi operation schemes are developed and applied to maximize the system efficiency with such a fleet. Some limitations and future research agenda have also been discussed.

Phd Dissertation

Essays on missing data models and MCMC estimation

Publication Date

May 31, 2012

Author(s)

Abstract

My dissertation is composed of four chapters that focus on missing data models, BLP contraction mappings, and Markov chain Monte Carlo estimation. The first chapter focuses on estimating sample selection models with two incidentally truncated outcomes and two corresponding selection mechanisms. The method of estimation is an extension of the Markov chain Monte Carlo (MCMC) sampling algorithm from Chib (2007) and Chib et al. (2009). Contrary to conventional data augmentation strategies for dealing with missing data, the proposed algorithm augments the posterior with only a small subset of the total missing data caused by sample selection. This results in improved convergence of the MCMC chain and decreased storage costs, while maintaining tractability in the sampling densities. The methods are applied to estimate the effects of residential density on vehicle miles traveled and vehicle holdings in California. The empirical results suggest that residential density has a small economic impact on vehicle usage and holdings. In addition, the results show that changes to vehicle holdings from increased residential density are more sensitive for less fuel-efficient vehicles than for fuel-efficient vehicles on average. The second chapter considers the estimation of a multivariate sample selection model with p pairs of selection and outcome variables. A unique feature of this model is that the variables can be discrete or continuous with any parametric distribution, allowing a large class of multivariate models to be accommodated. For example, the model may involve any combination of variables that are continuous, binary, ordered, or censored. Although the joint distribution can be difficult to specify, a multivariate Gaussian copula function is used to link the marginal distributions together and handle the multivariate dependence. The proposed estimation approach relies on the MCMC-based techniques from Lee (2010) and Pitt et al. (2006) and adapts the methods from the preceding authors to a missing data setting. An important aspect of the estimation algorithm, in the same spirit as the algorithm from the first chapter, is that it does not require simulation of the missing outcomes. This has been shown to improve the mixing of the Markov chain. The methods are applied to both simulated and real data. The third paper analyzes a discrete choice model where the observed outcome is not the exact alternative chosen by a decision maker but rather the broad group of alternatives in which the chosen alternative belongs to. This model is designed for situations where the choice behavior at a lower level is of interest but only higher level data are available (e.g. analyzing households’ choices for vehicles at the make-model-trim level but only choice data at the make-model level are observed). I show that the parameters in the proposed model are locally identified, but for certain configurations of the data, they are weakly identified. Methods to incorporate additional information into the problem are discussed, and both maximum likelihood and Bayesian estimation methods are explored. The last chapter proposes improvements to the contraction mappings used in the context of multinomial logit models. The contraction mapping algorithm proposed in Berry et al. (1995) is slow to converge and is a major burden to implement in applied work. While it is relatively quick to converge for a single run of the algorithm, it is computationally expensive when repeated evaluations are needed, particularly when the algorithm is embedded into maximum likelihood, generalized method of moments, or Bayesian Markov chain Monte Carlo estimation routines. To alleviate this problem, I explore four simple modifications of the contraction mapping to improve its rate of convergence. Importantly, the modifications can be incorporated into existing code with minimal effort. In a simulation study, I demonstrate that the new algorithms require significantly fewer iterations to converge to the unique vector of fixed points than the original specification. The best algorithm results in an 80-fold improvement.

Phd Dissertation

Methodology for Tour-based Truck Demand Modeling

Publication Date

February 23, 2012

Author(s)

Abstract

Freight truck movements exhibit extensive trip interaction between shippers, receivers, and carriers of goods, logistics constraints, and use of advanced information technology. Such characteristics cannot be accurately captured by the traditional four-step approach which has been widely used in state and regional government agencies under the assumption that trips are independent. In this dissertation, it is possible to develop tour-based models using with two main approaches, in order to properly capture the trip-chaining behavior of clean drayage truck movements at the San Pedro Bay Ports (SPBP): 1) disaggregate level tour-based model using the Sequential Selective Vehicle Routing Problem (SSVRP) providing a utility-maximizing decision-making optimization framework and 2) aggregate level tour-based model using Entropy Maximization Algorithm. Before discussing the two different tour-based models, the first step is to analyze GPS data for interpreting the drayage trucks’ characteristics and providing model inputs. The brief background of GPS data is as follows: In recent years the Clean Trucks Program (CTP) has been implemented at California’s San Pedro Bay Ports (SPBPs) of Long Beach and Los Angeles to help address major environmental issues associated with port operations. “Clean trucks” (meeting 2007 model year emission standards) that utilized public funds to replace older polluting drayage trucks were required to be fitted with GPS units for compliance monitoring. In late 2010, 94% of cargo moves at the SPBPs were reportedly made by clean trucks. The study reported in this dissertation is based on a year of such GPS data for a sample that in 2010 comprised 545 clean drayage trucks. With the background, an analytical framework is introduced for processing GPS data to both interpret the trip chaining (or tour behavior) of the clean drayage trucks, and to prepare sufficient tour data for clean truck modeling at the SPBPs. After analyzing the data using the toolkit, one of the significant findings on the clean drayage truck operations is that the tours could be classified under four types, three of which contain repetitive trip patterns in a tour while the fourth tends to show travel in circulatory patterns. This analysis amply demonstrated why current models cannot address drayage truck behavior and why tour-based modeling of the drayage trucks is needs to be developed with sufficient care towards the type of routes the trucks operation. Two other theoretical advances in the research are the development of tour-based models using an Entropy Maximization Algorithm and a Selective Vehicle Routing Problem (SVRP). For the aggregate level, the revised tour-based entropy maximization model upgrades the tour-based entropy maximization model by Wang and Holguín-Veras (2009) which mostly focuses on general commercial vehicles. After introducing new constraints regarding sequential stops to Traffic Analysis Cells (TACs), the clean drayage truck tour behavior can be addressed with complex tour patterns. The revised tour-based entropy maximization model with a Primal Dual Convex Optimization (PDCO) algorithm is seen to converge very quickly. At the disaggregate level, the SSVRP model provides a utility-maximizing decision-making optimization framework under spatial-temporal constraints to explain observed truck patterns as activity participation analogous to household activity patterns. This would be impossible without the ability of Inverse Sequential Selective Vehicle Routing Problem (InvSSVRP) to calibrate the objective coefficients and arrival time constraints such that observed patterns are optimal values. The nodes (or TACs) are sequence-expanded to allow multiple stops at each node and divided into two arrival states (from depot or not from depot) in SSVRP, which provides for much more realism in capturing the drayage truck behavior. To make better use of the two proposed models, the framework of each tour-based model estimation and forecasting process is illustrated. Lastly, several future topics of relevance to improving the tour-based models are discussed.

working paper

The Location Selection Problem for the Household Activity Pattern Problem

Publication Date

December 31, 2011

Abstract

In this paper, an integrated destination choice model based on routing and scheduling considerations of daily activities is proposed. Extending the Household Activity Pattern Problem (HAPP), the Location Selection Problem (LSP-HAPP) demonstrates how location choice is made as a simultaneous decision from interactions both with activities having predetermined locations and those with many candidate locations. A dynamic programming algorithm, developed for PDPTW, is adapted to handle a potentially sizable number of candidate locations. It is shown to be efficient for HAPP and LSP-HAPP applications. The algorithm is extended to keep arrival times as functions for mathematical programming formulations of activity-based travel models that often have time variables in the objective.

working paper

On Activity-based Network Design Problems

Abstract

This paper examines network design where OD demand is not known a priori, but is the subject of responses in household or user itinerary choices that depend on subject infrastructure improvements. Using simple examples, we show that falsely assuming that household itineraries are not elastic can result in a lack in understanding of certain phenomena; e.g., increasing traffic even without increasing economic activity due to relaxing of space-time prism constraints, or worsening of utility despite infrastructure investments in cases where household objectives may conflict. An activity-based network design problem is proposed using the location routing problem (LRP) as inspiration. The bilevel formulation includes an upper level network design and shortest path problem while the lower level includes a set of disaggregate household itinerary optimization problems, posed as household activity pattern problem (HAPP) (or in the case with location choice, as generalized HAPP) models. As a bilevel problem with an NP-hard lower level problem, there is no algorithm for solving the model exactly. Simple numerical examples show optimality gaps of as much as 5% for a decomposition heuristic algorithm derived from the LRP. A large numerical case study based on Southern California data and setting suggest that even if infrastructure investments do not result in major changes in itineraries the results provide much higher resolution information to a decision-maker. Whereas a conventional model would output the best set of links to invest given an assumed OD matrix, the proposed model can output the same best set of links, the same OD matrix, and a detailed temporal distribution of activity participation and travel, given a set of desired destinations and schedules.

working paper

Strategic Hydrogen Refueling Station Locations Analysis with Scheduling and Routing Considerations of Individual Vehicles

Abstract

Set Covering problems find the optimal provision of service locations while guaranteeing an acceptable level of accessibility for every demand points in a given area. Other than reliance on static,exogenously-imposed accessibility measures, these problems either exclude substantive infrastructure-vehicle interactions or only include fragmented infrastructure-vehicle interactions related to the routing considerations of households seeking refueling service as a requirement of performing routine, daily activities. Here, we address this problem by coupling a Location-Routing Problem (LRP) that uses the set covering model as a location strategy to the Household Activity Pattern Problem (HAPP) as the mixed integer scheduling and routing model that optimizes households’ participation in out-of-home activities. The problem addressed includes multiple decision makers: the public/private sector as the service provider, and the collection of individual households that make their own routing decisions to perform a given set of “out-of-home activities” together with a visit to one of the service locations. A solution method that does not necessarily require the full information of the coverage matrix is developed to reduce the number of HAPPs that needs to be solved. The performance of the algorithm, as well as comparison of the results to the set covering model, is presented. Although the application is focused on identifying the optimal locations of Hydrogen Fuel Cell Vehicle (HFCV) refueling stations, this proposed formulation can be used as a facility location strategy for any service activity that is generally toured with other activities.

research report

Deployment of a Tool for Measuring Freeway Safety Performance

Abstract

This project updated and deployed a freeway safety performance measurement tool, building upon a previous project that developed the core methodology. The tool evaluates the cumulative risk over time of an accident or a particular kind of accident. The probability is estimated using a model that takes as input only variables that are derived from common inductive loop detectors. The estimated models predict increased risk of any accident occurring, as well as a number of characteristics of those accidents. By using this safety performance measurement tool, Caltrans will be able to evaluate the safety impacts of roadway changes over time. Specifically, it is anticipated that new deployments of intelligent transportation systems elements can be evaluated for their safety impacts by comparing the net risk of different kinds of accidents before and after deployment. The model predictions are best used to evaluate the cumulative probability of accidents and accident characteristics over longer time horizons and extended stretches of roadway.

Phd Dissertation

Integrated Modeling of Air Quality and Health Impacts of a Freight Transportation Corridor

Abstract

Due to environmental concerns, transportation studies have extensively evaluated emission impacts associated with traffic operational strategies and transportation policies. However, the impact studies mainly relied on emission impacts found using demand forecasting models. Such planning models cannot capture individual vehicles. interactions (i.e., lane changes or stop-and-go movements) or detailed traffic operations such as with traffic signals. These limitations often lead to under-estimated emissions while evaluating several policies. Even though many studies utilized microscopic traffic models to better estimate emissions, the studies have not considered further steps such as air quality estimation and health impact studies. This research develops an integrated framework for evaluating air quality and health impacts of transportation corridors using a microscopic traffic model, a micro-scale emissions model, a non-steady state dispersion model, and a health impact model. The main advantage of this approach is to better estimate air quality and health impacts from vehicle interactions and detailed traffic management strategies. As a case study, we evaluate air quality and health impacts of several scenarios associated with major transportation corridors accessing the San Pedro Bay Ports (SPBP) complex, California. The study context consists of two 20 miles-long major freight freeway corridors and nearby arterials, as well as line-haul rail along the Alameda corridor and several rail yards associated with the SPBP complex. For the scenarios, we consider a clean truck program, cleaner locomotives, and modal shifts compared to the 2005 baseline. All scenarios performed with the integrated framework have provided larger improvements of air quality and health impacts associated with transportation corridors than conventional frameworks using transportation planning models. However, the difference in air quality and health impacts from modal shift scenarios between clean trucks and locomotives are minor. As exploratory research, pollution response surface models are developed. The main objective of the pollution response surface model is to avoid the high computational cost of the microscopic traffic model, which makes it difficult to estimate traffic for multiple days needed for evaluating emissions and health impacts over longer periods such a climate season. A conceptual framework for estimating pollution response surface models is proposed. Using a hypothetical network, response surfaces of NOX and PM are estimated.