Newsgroup sci.stat.math 12056

Articles

Subject: Re: probability question
From: jpl@stat.ohio-state.edu (John P Lawrence)
Date: 4 Dec 1996 18:00:13 -0500

In article <584k1f$b8j@news2.charm.net>,
T. Scott Thompson  wrote:
>Those of you who have been on this group for a long time will
>recognize this question as one that should be posted on sci.math.stat
>instead of sci.stat.math, but since the former no longer exists, here
>it is...
>
>Suppose that y is a random n x 1 vector distributed such that
>Pr(y'y=1)=1.  Let X = {x: Pr(x'y=0)>0 }.
>
>Question: Can X have strictly positive Lebesgue measure?  (If so,
>please give an example.)
No.  There are at most a countable number of y's on the unit sphere that can
have positive probability.  The measure of the x's associated with each
of these y's is 0.  So, the measure of X is 0 (countable additivity).
>A closely related question: Suppose that x and y are independent
>random vectors such that Pr(x'x=y'y=1)=1.  Must it be the case that
>Pr(x'y=0)=0?
No.  Think about defining x and y to equal particular points
on the unit sphere with probability 1.  You can choose them so
that x and y are orthogonal.
>This is needed to solve a real research problem, and is not homework!
>
>----------------------------------------------------------
>I don't speak for the U.S. Department of Justice.
>Any views expressed here are my own.
>----------------------------------------------------------
>T. Scott Thompson			thompson@charm.net
>U.S. Dept. of Justice			thompsts@usdoj.gov
>Antitrust Division			(202) 307-3726

Return to Top

Subject: TO CHAID or not To CHAID?
From: gulyas@unixg.ubc.ca (Gyula Gulyas)
Date: 4 Dec 1996 16:50:08 -0800

I would like to know the pros and cons on using ChAID (Chi-square 
Automatic Interaction Detection). I tend to think that logistic 
regression can do the same thing (in terms of the outcome..) and can also 
deal with continuous variables. Can ChAID deal with intercations and such 
for example? What are its assumptions?
Thanks for your responses in advance.
Gyula
PS: Would you forward a copy of your response to ggulyas@hasimons.com, too?
-- 
Gyula Gulyas
Centre for Applied Conservation Biology
Faculty of Forestry, UBC
email: gulyas@unixg.ubc.ca   Tel:(604)822-4131   Fax:822-9102

Return to Top

Subject: Re: Water Purification
From: Peter Clark
Date: Wed, 04 Dec 1996 20:15:52 -0600

Edwin Edwards wrote:
> 
> I am requesting information on the use of hydrogen peroxide as disinfectant in
> water treatment.
> 
> edw0234@gecko.uvi.edu
??  H2O2 + H20 = H4O3  ??
I'd try a different newsgroup if I were you
-- 
______________________________________________
Peter Clark, Ph.D.,  New Orleans, USA
http://www.geocities.com/NapaValley/2509/

Return to Top

Subject: Re: multicollinearity
From: pfv2@cornell.edu (Paul Velleman)
Date: Wed, 04 Dec 1996 21:34:38 -0500

In article <32A4A3E6.634@ucdavis.edu>, jim bouldin 
wrote:
> I would like to "deal with" some potentially high levels of correlation
> between the independent variables in a multiple regression so that I may
> predict the effect of changing particular ind variables on the dependent
> variable.  
Your problem is clear from your first paragraph. I presume that you mean
that you want to find the effect on the dependent variable of chaning a
particular independent variable *while holding the other independent
variables constant*. If you have naturally collinear predictors, you cannot
find this effect because you probably can't change one of the collinear
variables and hold the others constant, so the question you are asking is
inappropriate for these data.
The method you mention has the effect of removing the effects of all other
predictors from one predictor of interest. What it leaves behind is not the
original predictor, but the part of that predictor not linearly predictable
from the other predictors. When you have collinearity, this part is likely
to be (and to *mean*) something quite different from the original variable.
The coefficient you estimate in a multiple regression actually refers (in a
sense) to this new variable; interpreting it in terms of the original
variable is wrong. In that sense, the method is fine. But it won't answer
the question you asked.
-- Paul Velleman

Return to Top

Subject: Re: multicollinearity
From: jim bouldin
Date: Wed, 04 Dec 1996 11:49:52 -0800

Thanks for your response.  Yes I do want to forecast, but I want to
forecast the effects of changes in one ind. variable with all others
held constant, like in an ANOVA.  Specifically I want to separate out
the effects of temperature and precipitation on tree growth rates
(experimental approaches are not possible).  Also I'm not sure how
increasing the sample size, already near 10K, will remove
multicollinearity.
Jim Bouldin
UC Davis

Return to Top

Subject: Re: Practical vs Educational
From: aacbrown@aol.com
Date: 5 Dec 1996 02:49:07 GMT

Antonio Black  in <32A4EABF.423A@sfsu.edu> asks:
> Is my end user going to want me to follow non essential
> procedures, put boxes around this and red ink that?
In my experience, most jobs require you to follow lots of non-essential
procedures. Getting the job done and getting the right answer are rewarded
less than following the rules (and following the rules is rewarded less
than playing the game). This is true in statistics and elsewhere.
However, you don't have to like or accept that. If you want to do
something useful with your life you have to resolve to ignore the
foolishness. You may make less money but at least you won't waste your
time.
The issue of grading your work is entirely different. Instructors have to
guess from what you write whether you know what you're doing. If you know
something you may consider it "non-essential" to write it down; but your
instructor may not know whether or not you know. Or, as you seem to
believe, your instructor may not know what is and is not essential. The
best plan in future may be to write everything down.
Aaron C. Brown
New York, NY

Return to Top

Subject: Re: Pronunciation of LaTeX
From: jgarcia@kinam.ens.uabc.mx (Jose J. Garcia-Ruvalcaba)
Date: 5 Dec 1996 03:41:20 GMT

In article <56bbsb$57s@hcunews.hiroshima-cu.ac.jp>, Hideo Hirose
(hirose@cs.hiroshima-cu.ac.jp) wrote:
>In Japan, many researchers pronounce LaTeX as "latef." Is it correct? How do you 
>pronounce TeX and LaTeX actually, especially in the united states?
>
>
>
In Mexico, we say something like
Lah-tej,
as the greek chi is somewhat similar, altough not equivalent,
to the spanish j.

Return to Top

Subject: Nonlinear Optimization with complex functions
From: Gerhard Heinzel
Date: Thu, 5 Dec 1996 11:06:36 +0100

I need to fit a complex function of a real variable to complex measured
data. To be specific, the data are electric transfer functions (measured
by a spectrum analyzer), with the frequency as independent variable and
the model is a relativly simple complex function of freeuncy typically
involving 4 to 10 unknown parameters, sometimes more. 
All algorithms I found so far assume real-valued data. Of course I could
define an SSQ or chi^2 as usual and apply a general minimization
algorithm, as e.g. Nelder-Mead Simplex. But I wonder if there are
specialized, more efficient algorithms or even implementations of e.g.
Levenberg-Marquardt? 
I would be grateful for any hints, 
Gerhard Heinzel
=====================================================================
  Gerhard Heinzel                          E-mail:   ghh@mpq.mpg.de
  Max-Planck-Institut fuer Quantenoptik
  Hans-Kopfermann-Str. 1                    Phone: +49(89)32905-268
  D-85748 Garching                                             -252
  Germany                                     Fax: +49(89)32905-200
=====================================================================

Return to Top

Subject: Statistical test based on KL-divergence available?
From: jan@atlas.informatik.uni-bonn.de (Jan Puzicha)
Date: 05 Dec 1996 10:38:44 GMT

I have been looking for statistical tests on equality of two empirical distributions. A lot of tests like Kolmogorov-Smirnoff, Cramer-von Mises and Chi-Square have been proposed and the distribution of the corresponding statistic has been analysed.
But I wonder, why I have not found any test based on the KL-divergence, which is known to be kind of a natural distance between distributions. Especially, as one has to take care of degenerate states, a natural statistic would be the following symmetrized empirical KL-statistic:
let p(i) and q(i) be two independend empirical distributions for some set of states or bins {i}, let s(i) = (p(i)+q(i))/2. 
	KL = sum_i [p(i) log(p(i)/s(i)) + q(i) log(q(i)/s(i))]
Questions:
1.) What is the distribution of KL under the assumption, that p and q are independent samples from the same underlying distribution.
2.) Does this yields a reliable statistical test. (some experiments done in a rather different context do indicate this).
3.) Is there any reference concerned with this topic?
Thanks a lot in advance
Jan
--------------------------------------------------------------------
Jan Puzicha                  | email: jan@uran.cs.uni-bonn.de
Institute f. Informatics III |        jan@cs.uni-bonn.de
University of Bonn           | WWW  : http://www.cs.uni-bonn.de/~jan 
                             |
Roemerstrasse 164            | Tel. : +49 228 73-4102  
D-53117 Bonn                 | Fax  : +49 228 73-4382

Return to Top

Subject: [Q] Probability
From: suthan@cee.hw.ac.uk (Manickam Umasuthan)
Date: Thu, 5 Dec 1996 13:44:58 GMT

I would like to know whether it is an easier way to solve the following
probability problem.
A box contains N balls of which N1 are red and N2 are black.
Samples (sample size p) are drawn from the box without replacement.
What is the probability of the probability that a sample would
contain at least one black? Also What is the probability a sample
would contain either all reds or all blacks?
thanks in advance 
suthan@cee.hw.ac.uk
-- 
+++

Return to Top

Subject: Re: [Q] Probability
From: "Robert E Sawyer"
Date: 5 Dec 1996 15:55:37 GMT

Homework hint:  Hypergeometric distribution.
-- 
Robert E Sawyer 
soen@pacbell.net
_________________
Manickam Umasuthan  wrote in article ...
| 
| I would like to know whether it is an easier way to solve the following
| probability problem.
| 
| A box contains N balls of which N1 are red and N2 are black.
| Samples (sample size p) are drawn from the box without replacement.
| What is the probability of the probability that a sample would
| contain at least one black? Also What is the probability a sample
| would contain either all reds or all blacks?
| 
| thanks in advance 
| suthan@cee.hw.ac.uk
| -- 
| +++
|

Return to Top

Subject: pragmatic rules for fit in GLM
From: Anouar Benmalek
Date: Thu, 05 Dec 1996 17:52:39 -0800

Good morning (or evening)!
	What are the "pragmatic rules" in assessing the fit
in generalised linear models, beside the plots of the residuals
and the asymptotic laws of the deviance (scaled deviance for S.A.S) and 
the Pearson X2?
			Thank
		Anouar Benmalek

Return to Top

Subject: Stat Librarites in C
From: bi495@FreeNet.Carleton.CA (Salaam Yitbarek)
Date: 5 Dec 1996 15:30:46 GMT

Hi,
  I'm looking for extensive statistical libraries written in C
or C++.  The only one I know about is the NAG C Library.  Any
other suggestions?
Thanks,
Salaam Yitbarek

Return to Top

Downloaded by WWW Programs
Byron Palmer

Newsgroup sci.stat.math 12056

Directory

Articles