NZSA Visiting Lecturers

The New Zealand Statistical Association coordinates and provides some financial support for a tour of New Zealand universities by a distinguished overseas statistician. Normally the funding covers domestic travel within New Zealand, while host institutions cover local costs.

Usually this person, known as the NZSA Visiting Lecturer, will spend two to three days at each of the six main university centres, and give at least two lectures at each place: one for a general audience, and one more closely tied to his or her own particular research interests.

The 2001 NZSA Visiting Lecturer was Professor Richard Tweedie of the University of Minnesota, now sadly deceased.

NZSA Visiting Lecturers have been:
Prof Richard Tweedie2001
Prof C.R. Rao2005
Prof Ray Chambers2008
Prof Ingram Olkin2010

Richard Tweedie, NZSA Visiting Lecturer 2001

Professor Richard Tweedie of the University of Minnesota was the first New Zealand Statistical Association Visiting Lecturer. Sadly, he passed away shortly after his visit.

As NZSA Visiting Lecturer Professor Tweedie visited and presented lectures at the Victoria University of Wellington, Otago University, University of Canterbury, University of Auckland, and Massey University (at Albany). He also presented a keynote address at the Symposium to Honour Professor David Vere-Jones.

Professor Tweedie was Head of the Division of Biostatistics, in the School of Public Health at the University of Minnesota. His research interests are in the theory and application of Markov chains, and biostatistics including especially meta-analysis and stochastic modeling. He published over 130 scientific papers, an acclaimed book (Markov Chains and Stochastic Stability), and has very extensive experience as a statistical consultant. He was Editor of Statistical Science.

Timings of Professor Tweedie’s visit, contact people and details of his lectures are given below:

Wellington, April 19 to 22 (Peter Thomson) Meta-analysis – Potentials, Problems and Pitfalls 4 pm, April 20, Victoria University of Wellington

Dunedin, April 22 to 23 (John Kittelson)

Christchurch, April 23 to 25 (Jenny Brown)

Auckland, April 25 to 27 (David Scott) Perfect Simulation for MCMC and Markov Chains 2 pm, April 26, AT1

Massey University at Albany Meta-analysis – Potentials, Problems and Pitfalls 2 pm, April 27, PLT1, Maths-Physics Building, University of Auckland

C.R. Rao, NZSA Visiting Lecturer 2005

A brief preamble to C.R. Rao’s visit is given in Newsletter 60.

C.R. Rao has been designated as a Massey University Distinguished Visitor. He will be the Keynote speaker on the first day of IWMS 2005. The first session of the Workshop commencing at 9.am, Tuesday March 29 will be an open public lecture.

C.R. Rao will also present the WCAS Workshop on 22 March 2005.

Itinerary

Monday March 07: Arrives Auckland

Tuesday March 08 – Saturday March 12: Visit Prof. Srinivasan, University of Auckland, Business School.

Monday March 14: Visit University of Otago . Give seminar: “Cross Examination of Data”.

Tuesday March 15: Visit University of Canterbury. Give Seminar in afternoon. “Statistics: The science, technology and art of creating new knowledge”.

Wednesday March 16: Visit University of Canterbury.

Thursday March 17: Visit Victoria University of Wellington. Give seminar: “Cross Examination of Data.”

Friday March 18: Visit Massey University, Palmerston North. Give seminar: “Statistics: Reflections on the past and visions for the future.”

Tuesday March 22: One-day workshop at McMeekan Centre, Ruakura, Hamilton. “Data Scrutiny and Data Mining” 4 talks.

Wednesday March 23: Visit University of Auckland, Department of Statistics. Give seminar (noon): “Statistics: Reflections on the past and visions for the future”

Thursday March 24: Visit University of Auckland.

Tuesday March 29, 9:30 am: Public lecture and IWMS Keynote talk. “Statistical proofs of matrix theorems.”

Wednesday Mar 30 – Thursday March 31: IWMS, including a technical talk, “Anti eigen and singular values”

Friday April 01: Leave NZ.

Contacts

Auckland (Business) : Anath Srinivasan a.srinivasan@auckland.ac.nz
Otago : Richard Barker rbarker@maths.otago.ac.nz
Canterbury : Easaw Chacko E.Chacko@math.canterbury.ac.nz
Victoria : Estate Khmaladze Estate.Khmaladze@mcs.vuw.ac.nz
Massey : Ganes Ganesalingam s.ganesalingam@massey.ac.nz
Hamilton : Nye John nye@stats.waikato.ac.nz
Auckland (Stats) : Chris Wild c.wild@auckland.ac.nz
IWMS : Jeff Hunter j.hunter@massey.ac.nz

Talk Abstracts

Cross Examination of Data

Abstract: Data obtained from historical records, designed experiments and sample surveys are not usually in a form where routine statistical methods can be employed and inferences drawn. There may be recording errors and missing observations. The data may be faked and contaminated with irrelevant data. Usually the stochastic model generating the data, essential for data analysis, is not known. The actual procedure planned for the collection of data might not have been strictly followed. Inferential analysis of data without examining these issues might lead to wrong conclusions.

The first task of a statistician is what R.A. Fisher emphasized to cross examine the data (CED), which is to look for deficiencies in data of the type mentioned above. Some questions could be answered by questioning those who collected the data, but statisticians must have the appropriate tools to elicit the answers from the data itself. This process is described by Tukey as exploratory data analysis (EDA), and by Mahalanobis as scrutiny of data (SOD). To some extent such preliminary analysis of data is an art, but much of it could be codified.

Statistics: The science, technology and art of creating new knowledge

Abstract: Practice of statistics today extends to the whole gamut of natural and social sciences, engineering and technology, management and economic affairs, as well as arts and literature. Statistics is being applied virtually to every field to make new discoveries and breakthroughs.

There are different concepts of knowledge: true knowledge as conceived by philosophers, mathematical knowledge deduced from given axioms, scientific knowledge as embodied in scientific theories and empirical knowledge with a specified amount of uncertainty inferred from observed data. It is the last type of knowledge which enables us to take optimal decisions if an action is necessary.

Some examples of questions that have been resolved by statistics will be given. Who wrote the poem discovered in a library without any record of authorship, Shakespeare or a contemporary poet. Did Shakespeare have ghost writers? Is the expression of a gene the same in a normal person and a cancer patient? Are goods produced by a machine according to specification? Is the second born child more intelligent than the first?

Statistics: Reflections on the past and visions for the future

Abstract: Statistics is not a basic discipline like mathematics, physics, chemistry or biology each of which has a subject matter of its own on which new knowledge is built. Statistics is more a method of solving problems and creating new knowledge in other areas. Statistics is used in diverse fields such as scientific research, legal practice, medical diagnosis, economic development and optimum decision making at individual and institutional levels.

What is the future of statistics in the 21st century which is dominated by information technology encompassing the whole of communications, interaction with intelligent systems, massive data bases, and complex information processing networks? The current statistical methodology based on probabilistic models applied on small data sets appears to be inadequate to solve new problems arising in emerging areas of science, technology and policy making. Ad hoc methods are being put forward under the title Data Mining by computer scientists and engineers to meet the demands of customers. The talk will focus on a critical review of current methods of statistics and future developments based on large data sets and enormous computing power and efficient optimization techniques.

Statistical proofs of matrix theorems

Abstract: Matrix algebra is extensively used in the study of linear models, multivariate analysis and optimization problems. It is interesting to note that the matrix results needed to prove statistical propositions can themselves be deduced using some statistical results which can be derived without using matrix algebra. The results are based on Fisher information and its properties which can be established without using matrix results.

For further information concerning Professor Rao’s visit contact:

Jeffrey J Hunter, Professor of Statistics
Institute of Information and Mathematical Sciences
Massey University, Albany Campus
Private Bag 102 904, North Shore Mail Centre
Auckland, 1330, NEW ZEALAND

Phone: +64 9 414 0800 Ext 41037
Fax: +64 9 441 8178
Web page: http://www.massey.ac.nz/~jhunter/
email: j.hunter@massey.ac.nz

Ray Chambers, NZSA Visiting Lecturer 2008

Ray Chambers is Professor of Statistical Methodology at University of Wollongong and has extensive research interests in the design and analysis of sample surveys, official statistics methodology, robust methods for statistical inference and analysis of data with group structure.

Statistics New Zealand is hosting Ray Chambers’ visit to New Zealand, and the New Zealand Statistical Association, through its Visiting Lectureship, is enabling Ray to visit other New Zealand centres.

A brief preamble to Ray Chambers’ visit will be given in Newsletter 67.

Itinerary

Monday 24 March: Arrives Christchurch.

Tuesday 25 March: Statistics New Zealand Christchurch office.

Wednesday 26 March: University of Canterbury.

Thursday 27 March: University of Otago.

Thursday 27 March (evening): Otago Statistics Group.

Wednesday 2 – Tuesday 8 April: Statistics New Zealand Wellington and OS Research and Victoria University of Wellington.

Wednesday 2 April, 3 pm, Statistics New Zealand:
A. “Robust Prediction of Small Area Means and Distributions”
Thursday 3 April, 6 pm: Wellington Statistics Group.
E: “Measurement Error in Auxiliary Information”
Friday 4 April: Victoria University of Wellington.
I: “Maximum Likelihood under Informative Sampling”
Monday 7 April, 3 pm, Statistics New Zealand:
H. “Estimation of the Finite Population Distribution Function”

Wednesday 9 April: Massey University, Palmerston North.

Monday 14 April: University of Waikato.

Tuesday 15 April: University of Auckland.

Wednesday 16 April: Departs Auckland.

Contacts

Christchurch : Jennifer Brown
Dunedin : John Harraway
Wellington : Sharleen Forbes
Palmerston North : Steve Haslett
Hamilton : Murray Jorgensen
Auckland : Chris Triggs

Talk Abstracts

A. Robust Prediction of Small Area Means and Distributions

Small area estimation techniques typically rely on mixed models containing random area effects to characterise between area variability. In contrast, the M-quantile approach to small area estimation avoids conventional Gaussian assumptions and problems associated with specification of random effects and uses M-quantile regression models to characterise small area effects. In this talk I will describe a general framework for robust small area prediction that is based on representing a small area estimator as a functional of a predictor of the within area distribution of the target variable, and is applicable under either a mixed model approach or a M-quantile approach. The usefulness of this framework will be demonstrated through< both model-based as well as design-based simulation, with the latter based on two realistic survey data sets containing small area information. An application to predicting key percentiles of district level distributions of per-capita household consumption expenditure in Albania in 2002 will be described.

B. Small Area Estimation Via M-quantile Geographically Weighted Regression

Spatially correlated data arise in many situations. When these data are used for small area estimation, a popular approach is to characterise the small area effects using a Simultaneous Autoregressive Regression model. An alternative approach incorporates the spatial information via Geographically Weighted Regression (GWR). In this talk I will describe how the M-quantile approach to small area estimation can be extended to situations where GWR is preferable. An important spin-off from this approach is more efficient synthetic estimation for out of sample areas. The usefulness of this framework will be demonstrated through model-based as well as design-based simulation. An application to predicting average Acid Neutralizing Capacity at 8-digit Hydrologic Unit Code level in the Northeast states of the USA will also be described.

C. Small Area Estimation Under Transformation To Linearity

Small area estimation based on linear mixed models can be inefficient when the underlying relationships are non-linear. In this talk I will describe small area estimation techniques for variables that can be modelled linearly following a non-linear transformation. In particular, I will show how so-called model-based direct estimation can be used with data that are consistent with a linear mixed model in the logarithmic scale, provided estimation weights are derived using model calibration. Simulation results will be presented which show that this transformation-based estimator is both efficient and robust with respect to the distribution of the random effects in the linear mixed model. An application to business survey data will also be discussed.

D. Robust Mean Squared Error Estimation for Linear Predictors for Domains

A crucial aspect of small area estimation is estimation of the mean squared error of the resulting small area estimators. In this talk I will discuss robust mean squared error estimation for linear predictors of finite population domain means. The approach that will be taken represents an extension of the well known ‘sandwich’ type variance estimator used in population level sample survey inference, and appears to lead to a mean squared error estimator that is simpler to implement, and potentially more robust, than alternatives suggested in the small area literature. The usefulness of this approach will be demonstrated through both model-based as well as design-based simulation, with the latter based on two realistic survey data sets containing small area information.

E. Measurement Error in Auxiliary Information

Auxiliary information is information about the target population of a sample survey over and above that contained in the actual data obtained from the sampled population units. The availability of this type of information represents a key distinction between sample survey inference and more mainstream inference scenarios. In particular, modern methods of sampling inference (both model-assisted as well as model-based) depend on the availability of auxiliary information to improve efficiency in survey estimation. However, such information is not always of high quality, and typically contains errors. In this talk I focus on some survey-based situations where auxiliary information is crucial, but where this information is not precise. Estimation methods that allow for this imprecision will be described. In doing so I will not only address the types of inference of concern to sampling statisticians (e.g. prediction of population quantities), but also inference for parameters of statistical models for surveyed populations.

F. Maximum Likelihood With Auxiliary Information

In this talk I use a general framework for maximum likelihood estimation with complex survey data to develop methods for efficiently incorporating external population information into linear and logistic regression models fitted via sample survey data. In particular, saddlepoint and smearing methods will be used to derive highly accurate approximations to the score and information functions defined by the model parameters under random sampling and under case-control sampling when auxiliary data on population moments are available. Simulation-based results illustrating the resulting gains in efficiency will also be discussed.

G. Analysis of Probability-Linked Data

Over the last 25 years, advances in information technology have led to the creation of linked individual level databases containing vast amounts of information relevant to research in health, epidemiology, economics, demography, sociology and many other scientific areas. In many cases this linking is not perfect but can be modelled as the outcome of a stochastic process, with a non-zero probability that a unit record in the linked database is actually based on data drawn from distinct individuals. The impact of the resulting linkage errors on analysis of data extracted from such a source is only slowly being appreciated. In this talk I will describe a framework for statistical analysis of such probability-linked data. Applications to linear and logistic regression modelling of this type of data will be discussed.

H. Estimation of the Finite Population Distribution Function

Although most survey outputs consist of estimates of means and totals, there are important situations where the primary focus is estimation of the finite population distribution function, defined as the proportion of population units with values less than or equal to the argument of this function. In this talk I will describe design-based, model-assisted and model-based methods for predicting a finite population distribution function, focussing on a situation where the underlying regression relationship is non-linear. An application to estimation of the distribution of hourly pay rates will be discussed in some detail.

I. Maximum Likelihood under Informative Sampling

Loosely speaking, a sampling method is informative if the random variable corresponding to the outcome of the sampling process and the random variable corresponding to the response that is of interest are correlated in some way. An examples of informative sampling is size-biased sampling. In this talk I describe a general framework for likelihood-based inference with sample survey data, including data collected via informative sampling. Some simple examples will then be used to contrast maximum likelihood estimation within this framework with alternative likelihood-based approaches that have been suggested for data collected under informative sampling.

For further information concerning Professor Chambers’ visit contact:

Kim Cullen
Subject Matter Project Manager
Statistical Education and Research
Statistics New Zealand
+64 (04) 931 4886
kimberly.cullen@stats.govt.nz

Professor Ingram Olkin, NZSA Visiting Lecturer 2010


We are delighted that Professor Ingram Olkin (Stanford University) will be the NZSA Visiting Lecturer for 2010. His visit will be associated with our next conference and the joint International Conference on Statistical Methodologies and Related Topics celebrating the contribution of Chin-Diew Lai.

Dr Olkin is an icon in the world statistical community, having been active for over 60 years. He is a member of many professional societies, has received many honours and awards, has held and holds many editorial positions, and has delivered numerous invited addresses all over the world. Ingram has coauthored 7 books, edited 10 books, and contributed 220 journal papers. His joint paper with Albert Marshall “A multivariate exponential distribution” was cited in 610 articles – a testament to high calibre research.

Dr Olkin’s work is aimed at ensuring that educators select the proper statistical tools for measuring the outcomes of their programs and methods, and that their interpretation of the results is similarly rigorous. His research includes the development of powerful new statistical methods for combining results from independent studies that have analysed the same topic. Meta-analysis is assisting researchers to reconsider long-standing educational problems with a fresh critical eye.

Dr Olkin is a Guggenheim, Fulbright, and Lady Davis Fellow, with an honorary Doctorate from De Montfort University. He received his BS in mathematics at the City College of New York, his MA from Columbia University, and his PhD from the University of North Carolina. Dr Olkin’s research interests include analysis of social and behavioural models; multivariate statistical analysis; correlational and regression models in educational processes and meta-analysis.

Dr Olkin will be visiting NZ universities and giving seminars. For up-to-date information email g.jones@massey.ac.nz.

Auckland University 22 – 23 June
Seminar: Wed 23 June, 11:00 am – 12:00 pm
Meta-Analysis: History and statistical issues for combining the results of independent studies.
Waikato University 24 – 25 June
Seminar: Fri 25 June, 10:30 am – 11:30 am
Meta-Analysis: History and statistical issues for combining the results of independent studies.Seminar: Fri 25 June, 2.00 pm – 3.00 pm
Majorization: A unified approach to inequalities.
NZSA Conference 29 June – 1 July
Plenary Speaker: TBA
Life Distributions in reliability and survival analysis.

Abstract

Semiparametric families are families that have both a real parameter and a parameter that is itself a distribution. A number of semiparametric parametric families suitable for lifetime data in survival or reliability are introduced: scale, power, frailty (proportional hazards), age, moment, and others. Interesting results on stochastic orderings are obtained for these families. The coincidence of two families provides a characterization of the underlying distribution. Some of the characterization results provide a rationale for the use of certain families. In this talk we provide an overview of these semiparametric families, and present several characterizations.

This work is a joint effort with Albert W. Marshall.

About the Speaker:
Professor Ingram Olkin (Stanford University) is the NZSA Visiting Lecturer for 2010. His visit to New Zealand is sponsored by Statistics New Zealand and is associated with the International Conference on Statistical Methodologies and Related Topics, in conjunction with the NZSA 2010 Conference, 29 June – 1 July 2010, Massey University, Palmerston North.

WSG 2 July
Seminar: Fri 2 July, 5.30 pm – 7.00 pm
Meta-Analysis: History and statistical issues for combining the results of independent studies.
Ministry of Education 5 July
(Wellington) Seminar: Mon 5 July, 10.30 am – 12.00 pm
Meta-Analysis: History and statistical issues for combining the results of independent studies.
Statistics New Zealand 5 July
(Wellington) Seminar: Mon 5 July, 2.00 pm – 3.00 pm
Measures of heterogeneity, diversity and inequality.
Victoria University 6 July
Seminar: Tues 6 July, 12.00 pm – 1.00 pm
Life Distributions in reliability and survival analysis.
Canterbury University 7 – 8 July
Seminar: Wed 7 July, 3.10 pm – 4.00 pm
Probabilistic proofs of matrix inequalities.
Seminar: Thurs 8 July, 3.10 pm – 4.00 pm
Meta-Analysis: History and statistical issues for combining the results of independent studies.

Chin-Diew Lai
Roger Littlejohn
John Haywood