Dear […] Thanks for raising problems which many of us are facing. I want to put some general observation (which may be indirectly associated with your problem) for user of statistics and then discuss your problem specifically. People associated with statistics can be divided in two groups- group of developers and users. Challenges for both are different. Developers of statistical methods work on assumption while users work on ground reality. At initial phase of development of statistics this gap was narrow but now it is widening. As user of statistics, our primary aim should be enrich concerned domain. One can do it in following steps (1) Check which type of abstract ideas and believes are prevailing in concerned domain. (2) Think how believes and abstract ideas (based on intuition) may be represented through data. Three things are important here (1) What characteristics (like caste, land, welfare) should used on what unit (household, community etc) (2) How these characteristics should be represented through data (3) What are dependent and independent characteristics (4) How independent characteristics are related- additively, interactively etc. This is very crucial step. Here it is pertinent to
mention that there may be more than one way to represent abstract idea (and characteristics). For example, welfare (a characteristics) of household may be represented in many ways through data. Similarly there may be different theories (set of independent variables) to explain the production. So basic model comes from expertise of domain. Statistical tool should be used to estimate (an test) the parameter of model so that comparison can be made. Statistician can also help in searching a better model by inclusion of more suitable characteristics or taking different function of characteristics. (3) Collect seemingly concerned data according to statistical methods (as far as possible) (4) Use statistical tool to explore, estimate and test parameter of model. (5) Revise initial model so that it may be supported through data in better way. It is ground reality that there may be limitation to use various statistical method. What you have to do is to show all your limitation in report. For example you are using secondary data and it is not random. In this case you should mention what are possible source of bias. See attach file “How to lie with statistics.pdf”. Problem of pure statistician is that generally socio economic data is not suitable of advance statistical tool. For applied statistician creativity is in using tools of one domain in others. For example life table method generally used by demographer but it may be used to understand dropout in education. Similarly hazard based model used in medical may be used in economics. Real challenge before pure statistician is to get sufficient expertise in different domain quickly and explore whatever data can say and publish it with its limitation. Generally without getting domain expertise (or collaboration of domain expert) we want to apply statistical tool in absolute. One of the reason that we are taught by experts (as tool developers not user) who never emphasize role of context. We are taught in terms of random variable. That is why we think statistical tools may be applied in absolute. Above mentioned steps are nearer to causal model which covers large proportion of human thinking. There may be other type of modeling (like used to explain queues and network). Steps used for such model will be different. As conclusion, I want to say search method and data as per need of problem in place of searching a problem and methods which is suitable for data. Do not exercise for changing your body to adjust with already created (some time second hand) shirt (data and method). Better to create a shirt which fit on your body. I know this philosophy will not suits to many applied statistician who are under pressure to create more research paper. Applied statistics in socio economic area is long way which starts from case study and participatory (qualitative surveys) to use of data of large scale quantitative survey for which data has been collected by others. Now I am coming to your questions. 1. I could not see your graphs, my browser is not opening it (may be due to security reason). Please send it as attachment. 2. I would like to revise my idea on interval and ratio scale. I could not understand why profit is not ratio scale? Whether ratio of two profits are not meaningful? Please see http://dogsbody.psych.mun.ca/VassarStats/webtext.html or http://faculty.vassar.edu/lowry/ch1pt1.html (if above is not working) 3. Without condition of normality, you can do a lot of things. For classical regression analysis, condition of normality is not required if you want to estimate parameters of model (with st error). Condition of normality is required if you want to test parameter. Even estimated values provide a lot of information to enrich the knowledge of domain. For testing you can suitably transform your dependent (and independent variable if needed) (as Mr. Madan said). 4. I would like to see reference requirement of normality for use of Pearson correlation (I am not posing question) With regards Nand Kishore
|
1 of 1 File(s)



