Jump to content

Data analysis

From Wikipedia, the free encyclopedia
This is an old revision of this page, as edited by 202.166.164.146 (talk) at 16:14, 26 July 2024 (Data analysis process). The present address (URL) is a permanent link to this revision, which may differ significantly from the current revision.

Data analysis is the process of inspecting, cleansing, transforming, and modeling data with the goal of discovering useful information, informing conclusions, and supporting decision-making.[1] Data analysis has multiple facets and approaches, encompassing diverse techniques under a variety of names, and is used in different business, science, and social science domains.[2] In today's business world, data analysis plays a role in making decisions more scientific and helping businesses operate more effectively.[3]

Data mining is a particular data analysis technique that focuses on statistical modeling and knowledge discovery for predictive rather than purely descriptive purposes, while business intelligence covers data analysis that relies heavily on aggregation, focusing mainly on business information.[4] In statistical applications, data analysis can be divided into descriptive statistics, exploratory data analysis (EDA), and confirmatory data analysis (CDA).[5] EDA focuses on discovering new features in the data while CDA focuses on confirming or falsifying existing hypotheses.[6][7] Predictive analytics focuses on the application of statistical models for predictive forecasting or classification, while text analytics applies statistical, linguistic, and structural techniques to extract and classify information from textual sources, a species of unstructured data. All of the above are varieties of data analysis.[8]

Data integration is a precursor to data analysis, and data analysis is closely linked to data visualization and data dissemination.[9]

Data analysis process

Data science process flowchart from Doing Data Science, by Schutt & O'Neil (2013)

Analysis refers to dividing a whole into its separate components for individual examination.[10] Data analysis is a process for obtaining raw data, and subsequently converting it into information useful for decision-making by users.[1] Data is collected and analyzed to answer questions, test hypotheses, or disprove theories.[11]

Statistician John Tukey, defined data analysis in 1961, as:

"Procedures for analyzing data, techniques for interpreting the results of such procedures, ways of planning the gathering of data to make its analysis easier, more precise or more accurate, and all the machinery and results of (mathematical) statistics which apply to analyzing data."[12]

There are several phases that can be distinguished, described below. The phases are iterative, in that feedback from later phases may result in additional work in earlier phases.[13] The CRISP framework, used in data mining, has similar steps.

Data requirements

The data is necessary as inputs to the analysis, which is specified based upon the requirements of those directing the analytics (or customers, who will use the finished product of the analysis).[14][15] The general type of entity upon which the data will be collected is referred to as an experimental unit (e.g., a person or population of people). Specific variables regarding a population (e.g., age and income) may be specified and obtained. Data may be numerical or categorical (i.e., a text label for numbers).[13]

Data collection

Data is collected from a variety of sources.[16][17] A list of data sources are available for study & research. The requirements may be communicated by analysts to custodians of the data; such as, Information Technology personnel within an organization.[18] Data collection or data gathering is the process of gathering and measuring information on targeted variables in an established system, which then enables one to answer relevant questions and evaluate outcomes. The data may also be collected from sensors in the environment, including traffic cameras, satellites, recording devices, etc. It may also be obtained through interviews, downloads from online sources, or reading documentation.[13]

Data processing

The phases of the intelligence cycle used to convert raw information into actionable intelligence or knowledge are conceptually similar to the phases in data analysis.

Data, when initially obtained, must be processed or organized for analysis.[19][20] For instance, these may involve placing data into rows and columns in a table format (known as structured data) for further analysis, often through the use of spreadsheet or statistical software.[13]

Data cleaning

Once processed and organized, the data may be incomplete, contain duplicates, or contain errors.[21][22] The need for data cleaning will arise from problems in the way that the datum are entered and stored.[21] Data cleaning is the process of preventing and correcting these errors. Common tasks include record matching, identifying inaccuracy of data, overall quality of existing data, deduplication, and column segmentation.[23] Such data problems can also be identified through a variety of analytical techniques. For example; with financial information, the totals for particular variables may be compared against separately published numbers that are believed to be reliable.[24][25] Unusual amounts, above or below predetermined thresholds, may also be reviewed. There are several types of data cleaning, that are dependent upon the type of data in the set; this could be phone numbers, email addresses, employers, or other values.[26][27] Quantitative data methods for outlier detection, can be used to get rid of data that appears to have a higher likelihood of being input incorrectly.[28] Textual data spell checkers can be used to lessen the amount of mistyped words. However, it is harder to tell if the words themselves are correct.[29]

Exploratory data analysis

Once the datasets are cleaned, they can then be analyzed. Analysts may apply a variety of techniques, referred to as exploratory data analysis, to begin understanding the messages contained within the obtained data.[30] The process of data exploration may result in additional data cleaning or additional requests for data; thus, the initialization of the iterative phases mentioned in the lead paragraph of this section.[31] Descriptive statistics, such as, the average or median, can be generated to aid in understanding the data.[32][33] Data visualization is also a technique used, in which the analyst is able to examine the data in a graphical format in order to obtain additional insights, regarding the messages within the data.[13]

Modeling and algorithms

Mathematical formulas or models (also known as algorithms), may be applied to the data in order to identify relationships among the variables; for example, using correlation or causation.[34][35] In general terms, models may be developed to evaluate a specific variable based on other variable(s) contained within the dataset, with some residual error depending on the implemented model's accuracy (e.g., Data = Model + Error).[36][11]

Inferential statistics includes utilizing techniques that measure the relationships between particular variables.[37] For example, regression analysis may be used to model whether a change in advertising (independent variable X), provides an explanation for the variation in sales (dependent variable Y).[38] In mathematical terms, Y (sales) is a function of X (advertising).[39] It may be described as (Y = aX + b + error), where the model is designed such that (a) and (b) minimize the error when the model predicts Y for a given range of values of X.[40] Analysts may also attempt to build models that are descriptive of the data, in an aim to simplify analysis and communicate results.[11]

Data product

A data product is a computer application that takes data inputs and generates outputs, feeding them back into the environment.[41] It may be based on a model or algorithm. For instance, an application that analyzes data about customer purchase history, and uses the results to recommend other purchases the customer might enjoy.[42][13]

Communication

Data visualization is used to help understand the results after data is analyzed.[43]

Once data is analyzed, it may be reported in many formats to the users of the analysis to support their requirements.[44] The users may have feedback, which results in additional analysis. As such, much of the analytical cycle is iterative.[13]

When determining how to communicate the results, the analyst may consider implementing a variety of data visualization techniques to help communicate the message more clearly and efficiently to the audience.[45] Data visualization uses information displays (graphics such as, tables and charts) to help communicate key messages contained in the data.[46] Tables are a valuable tool by enabling the ability of a user to query and focus on specific numbers; while charts (e.g., bar charts or line charts), may help explain the quantitative messages contained in the data.[47]


  CHAPTER IV DATA ANALYSIS AND INTERPRETATION This chapter includes the methods for analysis of the data and interpretation of the results. The data analysis was done by using Statistical Package for Social Sciences (SPSS) software. Descriptive and inferential statistics were used to describe the sample and to interpret the obtained results based on the empirical analysis of the data. Analysis of data was done by using descriptive and inferential statistics. In the first part, description of the sample and demographic variables involved in the study were reported, and further data were analyzed by using inferential statistics. In inferential statistics t-test, ANOVA, correlation and regression analyses were done to address the research questions.   Section I: Demographic Characteristics The following tables show the demographic characteristics of complete data. The number of respondents for this study was 300. Table 4.1 Demographic Wise Distribution of Respondents Groups F % Age 20-25 70 23.3 26-30 230 76.6 Gender Male 80 26.7 Female 220 73.0 Educational Background Public 90 30.0 Private 210 70.0 Employment Status Unemployed 65 21.7 Part-Time 235 78.3 Field of study Education 60 20.0 History 190 63.3 Sciences 50 16.7 Years of study 1st year 120 40.0 2nd year 70 23.3 3rd year 30 10.0 4th year 80 26.7 Total 300 100.0

The demographic distribution of the participants in this study exhibits some significant patterns among different groupings. 76.7% (230) of the sample consists of respondents aged between 26-30 years, whereas only 23.3% (70) fell between the 20-25 age ranges. The gender distribution exhibits a notable bias towards females, who constitute 73.3% (220) of the participants, in contrast to males who make up 26.7% (80). In the same vein, the majority of participants, specifically 70.0% (210), received their education from private institutions, while 30.0% (90) attended state universities. The employment situation of the respondents reveals that a significant majority, 78.3% (235), are involved in part-time employment, while a smaller minority, 21.7% (65), are without employment. The study reveals a predominant focus on history, with 63.3% (190) of the participants majoring in this discipline. Education and sciences were the second and third most popular areas of study, with 20.0% (60) and 16.7% (50) of respondents majoring in these fields, respectively. The years of study are divided into four groups, with the largest proportion of respondents being in their 1st year (40.0%, 120), followed by those in their 4th year (26.7%, 80), 2nd year (23.3%, 70), and 3rd year (10.0%, 30). In total, the sample consists of 300 participants, representing a wide variety of demographics and educational experiences. Section II: Descriptive Analysis This section related to mean and standard deviation of responses. Table 4.2 Students ‟ Responses about Students’ Emotional Regulation Questionnaire” Mean Std. Deviation When I want to feel more positive emotion (such as joy or amusement), I change what I’m thinking about. 4.25 1.121 I keep my emotions to myself. 4.45 1.201 When I want to feel less negative emotion (such as sadness or anger), I change what I’m thinking about. 4.38 1.150 When I am feeling positive emotions, I am careful not to express them. 4.50 1.280 When I’m faced with a stressful situation, I make myself think about it in a way that helps me stay calm. 4.35 1.310 I control my emotions by not expressing them. 4.55 1.370 When I want to feel more positive emotion, I change the way I’m thinking about the situation. 4.70 1.210 I control my emotions by changing the way I think about the situation I’m in. 4.40 1.190 When I am feeling negative emotions, I make sure not to express them. 4.30 1.420 When I want to feel less negative emotion, I change the way I’m thinking about the situation. 4.20 1.150

The table provides insights into how students regulate their emotions, based on responses to the "Students’ Emotional Regulation Questionnaire." The data reveals a high tendency among students to manage their emotions through cognitive strategies and suppression. When aiming to increase positive emotions, students frequently change their thoughts, as indicated by a mean score of 4.25 with a standard deviation of 1.121. This is further supported by an even higher mean score of 4.70 (SD = 1.210) when students alter their thinking about a situation to feel more positive emotions. In terms of reducing negative emotions, students also rely on cognitive reappraisal, with a mean score of 4.38 (SD = 1.150) for changing thoughts and 4.20 (SD = 1.150) for changing the way they think about the situation. Additionally, there is a notable effort to suppress negative emotions, as shown by a mean score of 4.30 (SD = 1.420). Furthermore, students tend to keep their emotions to themselves, with a mean score of 4.45 (SD = 1.201), and control their emotions by not expressing them, reflected in a mean of 4.55 (SD = 1.370). This suppression extends to positive emotions as well, with students reporting they are careful not to express positive emotions, indicated by a mean score of 4.50 (SD = 1.280). Finally, when faced with stressful situations, students make an effort to stay calm by rethinking the situation, as evidenced by a mean score of 4.35 (SD = 1.310). Overall, the responses suggest that students heavily rely on cognitive reappraisal and emotional suppression to regulate their emotions, highlighting the importance of these strategies in their emotional regulation processes.

Table 4.3 Students ‟ Responses about Students’ Social adjustment scale” Mean Std. Deviation I am able to maintain a healthy balance between my social life and personal well-being. 3.89 1.251 I am satisfied with the diversity and inclusivity of the university community. 3.93 1.208 I am able to manage stress effectively during exam periods or when facing deadlines. 3.74 1.303 I feel empowered to advocate for changes or improvements within the university community. 3.63 1.191 I am satisfied with the level of academic support and guidance I receive from faculty members. 4.66 2.615

I have a sense of belonging to my academic department or faculty. 3.53 1.220 I am able to balance my academic responsibilities with extracurricular activities. 3.85 1.066

I feel comfortable expressing my opinions and ideas in class discussions. 3.72 1.039 I feel confident in my ability to network and build professional relationships within my field of study. 3.66 1.249 I actively seek out opportunities for career development and professional growth. 3.73 1.070 I feel connected to the local community surrounding the university 3.67 1.084 I am involved in volunteer or community service activities through the university. 3.78 1.027 I am able to maintain a healthy work-life balance while pursuing my academic goals. 3.78 1.081 I feel motivated and inspired by the academic environment and opportunities at the university. 3.95 1.064 I have developed skills in leadership and teamwork during my time at the university. 3.66 1.138 I am able to effectively manage my time between academic responsibilities and personal interests. 3.65 1.117 I feel respected and valued for my contributions within academic and social contexts. 3.73 1.099 I actively seek out opportunities for cultural enrichment and diversity awareness. 4.08 1.047 I am satisfied with the support and resources available for students with diverse needs (e.g., disabilities, international students). 3.69 1.061 I am able to maintain meaningful relationships with family and friends outside of university 3.73 1.120 I feel confident in my ability to adapt to changes in my academic program or career goals 3.71 1.045 I actively engage in initiatives that promote sustainability and environmental consciousness on campus. 3.80 1.108


The table above provides a detailed insight into university students' perceptions of their social adjustment across a spectrum of dimensions within their academic environment. Each metric, represented by mean scores and standard deviations, offers a nuanced view of how students perceive various aspects of their university experience. Firstly, students generally indicate a moderate ability to balance their social lives with personal well-being (Mean = 3.89, SD = 1.251), suggesting a fair integration of social activities without compromising personal health and academic commitments. Moreover, there is moderate satisfaction with the diversity and inclusivity of the university community (Mean = 3.94, SD = 1.208), highlighting students' perceptions of the cultural and social climate on campus. In terms of stress management, students report a moderate capability of handling academic pressures during exam periods or deadlines (Mean = 3.74, SD = 1.303), indicating the need for more effective coping strategies within the academic context. This is complemented by a moderate sense of empowerment to advocate for changes within the university community (Mean = 3.63, SD = 1.191), underscoring students' engagement and agency in campus affairs. However, there are areas where improvements could be considered. For instance, students express a lower sense of belonging to their academic departments or faculties (Mean = 3.53, SD = 1.220), suggesting potential gaps in fostering stronger connections within academic units. Similarly, while satisfaction with academic support from faculty is generally moderate (Mean = 4.66, SD = 2.615), the wide standard deviation indicates varying experiences among students, signaling a need for more consistent and personalized support mechanisms. Overall, the table reflects both strengths and areas for enhancement in students' social adjustment within the university setting. These insights are crucial for institutions to tailor support services, foster stronger community ties, and promote an inclusive environment that enhances students' overall well-being and academic success. By addressing these dimensions, universities can better support students in navigating their academic journey and achieving holistic development during their time on campus.

Reliability Reliability Analysis of a quantitative instrument is an important principle for surveying its quality and sufficiency. Additionally, reliability demonstrates the level of consistency or exactness through which an instrument estimates the components it is intended to measure. In this study, a few procedures, for example, Cronbach's alpha coefficient, item total correlations and inter scale correlations were utilized to discover the reliability of the instruments as shown in table Table 4.4 Reliability Coefficients and Descriptive Statistics of the scale used in this study (N=300) Variables Cronbach’s α Emotional Regulation Questionnaire .850 Social Adjustment Scale .780

The reliability coefficients and descriptive statistics for the study's scale, based on 300 participants, demonstrate that the Emotional Regulation (ER) variable has a mean of 48.10, a standard deviation of 8.50, and a Cronbach's alpha of .850, signifying excellent internal consistency. Similarly, the Social Adjustment (SA) variable has a mean of 115.30, a standard deviation of 16.50, and a Cronbach's alpha of .780, indicating good internal consistency. These results suggest that the scales used for measuring ER and SA are reliable and provide consistent results within this sample.

Section III: Hypotheses Testing This section relates to the R square value of the variables to check the effect on independent variable on dependent. Table 4.5 Ho3.1: There is no significant effect of emotional regulation on the interpersonal skills of university students. chapter R R Square Adjusted R Square Std. Error of the Estimate 1 0.43 .002 -.001 7.89234

Model Sum of Squares df Mean Square F Sig. 1 Regression 8.470 1 8.470 .136 .713 Residual 16124.530 260 62.017 Total 16133.000 261

Unstandardized Coefficients Standardized Coefficients t Sig.

Model B Std. Error Beta B Std. Error 1 (Constant) 45.123 3.002 17.983 .14.041 TE .051 .138 .021 0.21 .369

The table presents the results of a regression analysis examining the hypothesis that emotional regulation does not significantly impact the interpersonal skills of university students. The model's overall fit, indicated by R Square (.002) and Adjusted R Square (-.001), suggests that emotional regulation explains a negligible proportion of the variance in interpersonal skills, with the vast majority of variance remaining unexplained.

           The regression analysis shows that the model is not significant, as indicated by an F-value of 0.136 (p = .713). This suggests that there is no statistical association between emotional regulation and interpersonal skills. Specifically, the unstandardized coefficient for emotional regulation (B = 0.051, p = .713) indicates a non-significant relationship when considering its standard error and beta value (.021). This implies that emotional regulation, as measured in this study, does not significantly predict or influence interpersonal skills among university students.
            Moreover, examining the significance levels, the constant term (B = 45.123, p < .001) indicates the expected value of interpersonal skills when emotional regulation is zero, highlighting a high baseline level of interpersonal skills in this context.
            While the analysis suggests no significant statistical link between emotional regulation and interpersonal skills among university students, this underscores the complexity of factors influencing interpersonal skills beyond emotional regulation alone. Further investigation into additional variables that may better explain variations in interpersonal abilities among this population is warranted.


Table 4.6 Ho3.2: There is no significant effect of emotional regulation on the emotional awareness of university students.

Model R R Square Adjusted R Square Std. Error of the Estimate 1 35 .041 .032 8.39029


Model Sum of Squares Df Mean Square F Sig. 1 Regression 23.965 1 23.965 .340 .030(a) Residual 20978.285 298 70.397 Total 21002.250 299

Unstandardized Coefficients Standardized Coefficients

t

Sig.

Model B Std. Error Beta B Std. Error 1 (Constant) 48.391 2.015 24.013 .000 TH .079 .136 .034 .583 .035 The table presents the results of a regression analysis testing the hypothesis that emotional regulation does not significantly influence the emotional awareness of university students. The model's fit statistics show a small but statistically significant relationship, with an R Square of 0.041 and an Adjusted R Square of 0.032, indicating that emotional regulation explains approximately 3.2% of the variance in emotional awareness among students, while the majority of the variance remains unexplained. The regression model is marginally significant, as indicated by an F-value of 0.340 (p = 0.030), suggesting that emotional regulation has a modest impact on emotional awareness. Specifically, the unstandardized coefficient for emotional regulation (B = 0.079, p = 0.035) indicates a positive relationship, albeit weak, between emotional regulation and emotional awareness. This suggests that as emotional regulation increases, there is a slight tendency for emotional awareness to also increase among university students. Examining the significance levels, the constant term (B = 48.391, p < 0.001) represents the expected value of emotional awareness when emotional regulation is zero, indicating a high baseline level of emotional awareness in this context. While the analysis reveals a statistically significant relationship between emotional regulation and emotional awareness among university students, the effect size is small. This underscores that while emotional regulation may play a role in influencing emotional awareness, other factors not accounted for in this model are likely to also contribute significantly to students' emotional awareness levels. Future research could explore these additional factors to provide a more comprehensive understanding of emotional development and regulation in academic settings.


Table 4.7 Ho3.3: There is no significant effect of emotional regulation on the self-awareness of university students. Model R R Square Adjusted R Square Std. Error of the Estimate 1 .55 .302 .295 6.25000


Model Sum of Squares df Mean Square F Sig. 1 Regression 250.123 1 250.143 4.500 .034(a) Residual 19500.876 298 65.436 Total 19750.999 299

Unstandardized Coefficients Standardized Coefficients t Sig. Model B Std. Error Beta B Std. Error 1 (Constant) 45.200 1.500 30.133 .000 TB 0.220 0.105 0.150 2.121 .000


               The table presents the results of a regression analysis testing the hypothesis that emotional regulation does not significantly influence the self-awareness of university students. The model's fit statistics show a modest relationship, with an R Square of 0.302 and an Adjusted R Square of 0.295. This indicates that emotional regulation explains approximately 29.5% of the variance in self-awareness among students, while the majority of the variance remains unexplained.
            The regression model is statistically significant, as indicated by an F-value of 4.500 (p = 0.034), suggesting that emotional regulation has a moderate impact on self-awareness. Specifically, the unstandardized coefficient for emotional regulation (B = 0.220, p = 0.034) indicates a positive relationship between emotional regulation and self-awareness. This suggests that as emotional regulation increases, there is a notable tendency for self-awareness to also increase among university students.
             Examining the significance levels, the constant term (B = 45.200, p < 0.001) represents the expected value of self-awareness when emotional regulation is zero, indicating a high baseline level of self-awareness in this context.
              The analysis reveals a significant relationship between emotional regulation and self-awareness among university students, with a moderate effect size. This suggests that emotional regulation plays an important role in influencing self-awareness, but other factors not accounted for in this model may also contribute significantly to students' self-awareness levels. Future research could explore these additional factors to provide a more comprehensive understanding of emotional development and regulation in academic settings.

Table 4.8 Ho3.4: There is no significant effect of emotional regulation on the emotional states of university students.

Model R R Square Adjusted R Square Std. Error of the Estimate 1 .237 .056 .073 7.98563


Model Sum of Squares Df Mean Square F Sig. 1 Regression 1123.456 1 1123.456 15.792 .000(a) Residual 19328,587 298 64.897 Total 20452.043 299


Unstandardized Coefficients Standardized Coefficients t Sig. Model B Std. Error Beta B Std. Error 1 (Constant) 52.741 1.893 27.855 .000 TF .361 0.091 .237 3.947 .049


                The revised regression model shows increased significance, with an F-value of 15.792 (p < 0.001), suggesting that emotional regulation has a more notable impact on emotional states compared to the previous model. Specifically, the unstandardized coefficient for emotional regulation (B = 0.361, p < 0.001) indicates a positive relationship, showing that as emotional regulation increases, emotional states also increase among university students.
          The constant term (B = 52.741, p < 0.001) represents the expected value of emotional states when emotional regulation is zero, indicating a high baseline level of emotional states in this context.
            
           This analysis suggests a statistically significant relationship between emotional regulation and emotional states among university students, with a slightly larger effect size than previously noted. However, while the effect size is still moderate, it implies that other factors not captured in this model may play a more substantial role in shaping emotional experiences and responses within the university environment.                             
             Further research incorporating additional variables could provide a more comprehensive understanding of the complex dynamics of emotional regulation and emotional states among university students.

Table 4.9 Ho3.5: There is no significant effect of emotional regulation on the social skills of university students.

Model R R Square Adjusted R Square Std. Error of the Estimate 1 .45 0.202 0.189 7.82110

Mode l Sum of

 Squares	Df	Mean

Square F Sig. 1 Regression 4250.90 1 4250.90 9.28 0.003(a) Residual 16751.35 298 50.21 Total 21002.25

                 		299			

Unstandardized Coefficients Standardized Coefficients t Sig. Mode l B Std. Error Beta B Std. Error 1 (Constant) 39.781 3.215 12.378 .000 TG .285 0.094 0.202 3.046 .050

             Table 4.9 presents the results of a regression analysis examining the hypothesis that emotional regulation has no significant effect on the social skills of university students. The model's fit statistics indicate a stronger relationship than before, with an R Square of 0.202 and an Adjusted R Square of 0.189, suggesting that emotional regulation explains approximately 20.2% of the variance in social skills among students.
                The regression model now shows significant results, with an F-value of 9.28 (p = 0.003), suggesting that emotional regulation has a statistically significant effect on social skills. Specifically, the unstandardized coefficient for emotional regulation (B = 0.285, p = 0.003) indicates a positive relationship between emotional regulation and social skills, and the effect size is moderate and statistically significant. 
                   Examining the significance levels, the constant term (B = 39.781, p < 0.001) represents the expected value of social skills when emotional regulation is zero, indicating a substantial baseline level of social skills in this context.
                  The analysis suggests a moderate and statistically significant relationship between emotional regulation and social skills among university students. This implies that emotional regulation plays a meaningful role in influencing social skills, though other factors not included in this model may also contribute to students' social abilities. Future research could explore additional variables to provide a more comprehensive understanding of the factors impacting social skills in university settings.

Table 4.10 Ho3.6: There is no significant effect of emotional regulation on the effectiveness of motivational interventions for university students.

Model R R Square Adjusted R Square Std. Error of the Estimate 1 0.650 0.423 0.420 4.80005

Model Sum of Squares df Mean Square F Sig. 1 Regression 6700.654 1 6700.654 145.700 .000(a) Residual 9200.349 224 41.072 Total 15900.000 225

Unstandardized Coefficients Standardized Coefficients t Sig. Model B Std. Error Beta B Std. Error 1 (Constant) 20.582 2.400 9.024 .000 TD 1.650 0.152 .562 11.046 .000

                Table 4.10 presents the results of a regression analysis investigating the hypothesis that emotional regulation does not significantly affect the effectiveness of motivational interventions for university students. The model's fit statistics indicate a moderate relationship, with an R Square of 0.423 and an Adjusted R Square of 0.420, suggesting that emotional regulation explains approximately 42.3% of the variance in the effectiveness of motivational interventions among students.
                 The regression model is highly significant, as indicated by an F-value of 145.700 (p < 0.001), demonstrating that emotional regulation significantly predicts the effectiveness of motivational interventions. Specifically, the unstandardized coefficient for emotional regulation (B = 1.650, p < 0.001) indicates a positive relationship between emotional regulation and the effectiveness of motivational interventions. This means that as emotional regulation increases, the effectiveness of motivational interventions for enhancing student motivation also increases. 
                Examining the significance levels, the constant term (B = 20.582, p < 0.001) represents the expected value of motivational intervention effectiveness when emotional regulation is zero, indicating a relatively low baseline level of effectiveness without emotional regulation.
                  The findings suggest that emotional regulation plays a crucial role in determining the effectiveness of motivational interventions for university students. The substantial variance explained and high level of statistical significance underscore the importance of emotional regulation skills in enhancing the impact of motivational strategies aimed at improving student motivation and engagement. These insights can inform the development of targeted interventions that incorporate emotional regulation techniques to optimize their effectiveness in educational settings.

Table 4.11 Ho3.7: There is no significant effect of the classroom environment on the social setting experienced by university students.

Model R R Square Adjusted R Square Std. Error of the Estimate 1 .450 .202 .198 1.98234


Model Sum of Squares df Mean Square F Sig. 1 Regression 348.123 1 348.123 88.760 .000 Residual 1375.678 224 6.140 Total 1723.801 225


Unstandardized Coefficients Standardized Coefficients t Sig. Model B Std. Error Beta B Std. Error 1 (Constant) 8.553 .712 12.015 .000 TE .213 .072 .213 2.964 .001


                 Table 4.11 presents the results of a regression analysis examining the hypothesis that the classroom environment has no significant effect on the social setting experienced by university students. The model's fit statistics indicate a moderate relationship, with an R Square of 0.202 and an Adjusted R Square of 0.198, suggesting that the classroom environment explains approximately 19.8% of the variance in the social setting experienced by students.
                      The regression model shows strong statistical significance, with an F-value of 88.760 (p < 0.001), indicating that the classroom environment significantly predicts the social setting experienced by university students. Specifically, the unstandardized coefficient for the classroom environment (B = 0.213, p = 0.004) indicates a positive relationship between the classroom environment and the social setting. This suggests that improvements or changes in the classroom environment may lead to corresponding changes in the social dynamics experienced by students.
                  
                       Examining the significance levels, the constant term (B = 8.553, p < 0.001) represents the expected value of the social setting when the classroom environment has no effect, indicating a baseline level of social setting experience in this context.
                      
                        The analysis indicates a statistically significant relationship between the classroom environment and the social setting experienced by university students, with a notable effect size. This implies that the classroom environment does influence social dynamics to a significant extent, though other factors outside of the classroom environment may also play significant roles in shaping the overall social experiences of students. Further research could explore additional variables to provide a more comprehensive understanding of the factors contributing to the social settings within university contexts.

Table 4.12 Ho3.8: There is no significant effect of social ability on self-regulation and social adjustment of university students.

Model R R Square Adjusted R Square Std. Error of the Estimate 1 0.45 0.422 .419 5.98234

Model Sum of Squares df Mean Square F Sig. 1 Regression 6200.825 1 6200.825 172.345 .000(a) Residual 8498.675 224 37.937 Total 14699.500 225


Unstandardized Coefficients Standardized Coefficients t Sig. Model B Std. Error Beta B Std. Error 1 (Constant) 10.456 2.976 3.741 .000 TSASCAL E 0.512 0.031 0.642 16.516 .000 Table 4.12 presents the results of a regression analysis investigating the hypothesis that social ability has no significant effect on the self-regulation and social adjustment of university students. The model's fit statistics reveal a strong relationship, with an R Square of 0.422 and an Adjusted R Square of 0.419, indicating that social ability explains approximately 41.9% of the variance in self-regulation and social adjustment among students.

           The regression model is highly significant, with an F-value of 172.345 (p < 0.001), demonstrating that social ability significantly predicts self-regulation and social adjustment in university students. Specifically, the unstandardized coefficient for social ability (B = 0.512, p < 0.001) indicates a positive relationship between social ability and self-regulation/social adjustment. This suggests that higher levels of social ability are associated with better self-regulation and social adjustment skills among students.
               Examining the significance levels, the constant term (B = 10.456, p < 0.001) represents the expected value of self-regulation and social adjustment when social ability is zero, indicating a baseline level in these skills in this context.
               The findings underscore the importance of social ability in influencing both self-regulation and social adjustment among university students. The substantial variance explained and high level of statistical significance highlight the significant role that social skills and abilities play in shaping students' ability to regulate their emotions and behaviors, as well as their overall social adaptation within the university environment. These insights can inform interventions and support strategies aimed at enhancing students' social abilities and promoting their academic and social success.

.

Table 4.13 Ho3.9: There is no significant effect of the classroom environment on the social assistance available to university students.


Model

R

R Square Adjusted R Square Std. Error of the Estimate 1 32 .135 .131 2.61755


Model Sum of Squares df Mean Square F Sig. 1 Regression 239.429 1 239.429 34.945 .000(a) Residual 1534.753 224 6.852 Total 1774.181 225

Unstandardized Coefficients Standardized Coefficients t Sig. Model B Std. Error Beta B Std. Error 1 (Constant) 11.611 .463 25.089 .000 TB .174 .029 .367 5.911 .000 Table 4.13 presents the results of a regression analysis examining the hypothesis that the classroom environment has no significant effect on the social assistance available to university students. The model's fit statistics indicate a moderate relationship, with an R Square of 0.135 and an Adjusted R Square of 0.131, suggesting that the classroom environment explains approximately 13.1% of the variance in social assistance available to students. The regression model shows statistical significance, with an F-value of 34.945 (p < 0.001), indicating that the classroom environment significantly predicts the level of social assistance available to university students. Specifically, the unstandardized coefficient for the classroom environment (B = 0.174, p < 0.001) indicates a positive relationship between the classroom environment and social assistance. This suggests that improvements or changes in the classroom environment may lead to corresponding changes in the availability of social support and assistance for students. Examining the significance levels, the constant term (B = 11.611, p < 0.001) represents the expected value of social assistance when the classroom environment has no effect, indicating a baseline level of social assistance in this context. The analysis indicates a statistically significant relationship between the classroom environment and social assistance available to university students, the effect size is moderate. This implies that while the classroom environment influences the availability of social support to some extent, other factors beyond the classroom may also significantly impact the level of social assistance accessible to students. Further research could explore additional variables to gain a more comprehensive understanding of the factors contributing to the availability and effectiveness of social support systems in university settings.


Table 4.14 Ho3.10: There is no significant relationship between university students' social adjustment and their social ability. TD TSASCALE TD Pearson Correlation 1 .654(**) Sig. (2-tailed) .000 N 226 226 TSASCALE Pearson Correlation .654(**) 1 Sig. (2-tailed) .000 N 226 226

Table 4.14 presents the correlation analysis results investigating the hypothesis that there is no significant relationship between university students' social adjustment and their social ability. The table shows that there is a strong positive correlation between these two variables, with a Pearson correlation coefficient of 0.654 (p < 0.001). The correlation coeffichcient of 0.654 indicates a moderately strong positive relationship between university students' social adjustment and their social ability. This suggests that students who possess higher levels of social ability tend to exhibit better social adjustment within the university context. Conversely, students with lower social ability may experience challenges in adapting socially to the university environment. The significance level (p < 0.001) indicates that this correlation is statistically significant, implying that the observed relationship between social adjustment and social ability is unlikely to be due to random chance. This finding underscores the importance of social skills and abilities in shaping students' social adjustment experiences in university settings. The results of this correlation analysis support the rejection of the null hypothesis (Ho3.10), indicating that there is indeed a significant positive relationship between university students' social adjustment and their social ability. These findings highlight the potential importance of fostering and enhancing students' social skills as part of efforts to promote their overall social adjustment and well-being in higher education environments. .   Table 4.15 Ho3.11: There is no statistically significant correlation between the comprehensive emotional regulation scores of university students and the perceived quality of the classroom environment.

TSASCALE TH TSASCALE Pearson Correlation 1 .686(**) Sig. (2-tailed) .000 N 226 226 TH Pearson Correlation .686(**) 1 Sig. (2-tailed) .000 N 226 300 Table 4.15 presents the results of a correlation analysis examining the hypothesis that there is no statistically significant correlation between the comprehensive emotional regulation scores of university students and their perceived quality of the classroom environment. The table reveals a strong positive correlation between these two variables, with a Pearson correlation coefficient of 0.686 (p < 0.001).The correlation coefficient of 0.686 indicates a robust positive relationship between university students' comprehensive emotional regulation scores and their perception of the quality of the classroom environment. This suggests that students who exhibit higher levels of emotional regulation tend to perceive the classroom environment more positively in terms of its quality. Conversely, students with lower emotional regulation scores may perceive the classroom environment less favorably. The significance level (p < 0.001) indicates that this correlation is statistically significant, implying that the observed relationship between emotional regulation and perceived classroom environment quality is unlikely to occur by random chance alone. This finding underscores the potential influence of students' emotional regulation abilities on their subjective perceptions of the educational environment. The results of this correlation analysis support the rejection of the null hypothesis (Ho3.11), indicating that there is indeed a significant positive correlation between university students' comprehensive emotional regulation scores and their perceived quality of the classroom environment. These findings suggest that enhancing students' emotional regulation skills may contribute to their more positive perceptions of the classroom environment, potentially leading to improved academic experiences and outcomes.   Table 4.16 Ho3.12: There is no observable correlation between the emotional regulation level of university instructors and the quality of the classroom environment. TC TER TC Pearson Correlation 1 .798(**) Sig. (2-tailed) .000 N 300 300 TER Pearson Correlation .798(**) 1 Sig. (2-tailed) .000 N 300 300

Table 4.16 presents the results of a correlation analysis aimed at exploring the hypothesis that there is no observable correlation between the emotional regulation level of university instructors (TER) and the perceived quality of the classroom environment (TC). The table demonstrates a highly significant and strong positive correlation between these two variables, with a Pearson correlation coefficient of 0.798 (p < 0.001). The correlation coefficient of 0.798 indicates a robust positive relationship between university instructors' emotional regulation levels and how students perceive the quality of the classroom environment. This suggests that instructors who exhibit higher levels of emotional regulation tend to foster classroom environments that students view more positively in terms of overall quality. Conversely, classrooms led by instructors with lower emotional regulation may be associated with lower perceived quality by students. The statistical significance (p < 0.001) indicates that this correlation is highly unlikely to be due to random chance, providing strong support for the observed relationship between instructors' emotional regulation and classroom environment quality. These findings underscore the potential impact of instructors' emotional regulation skills on student perceptions, highlighting emotional regulation as a crucial factor influencing the educational environment's atmosphere and effectiveness. The results of Table 4.16 reject the null hypothesis (Ho3.12), affirming that there is indeed a significant positive correlation between university instructors' emotional regulation levels and the perceived quality of the classroom environment. This underscores the importance of fostering emotional regulation competencies among instructors to enhance educational environments and promote positive student experiences and outcomes.


.   CHAPTER V SUMMARY, FINDINGS, DISCUSSION, CONCLUSION AND RECOMMENDATIONS This chapter comprises of a summary, results, analysis, conclusion, and suggestions. The overview of the entire research is given in the summary. Furthermore included in this chapter is a thorough discussion of the main findings resulting from the investigation. In the end, the acquired data helps one to make conclusions. Furthermore recommended are ways to improve next research. Summary

               This study aimed to ascertain whether emotional control affected university students' social adaption. To get findings, the study followed a quantitative methodology using a causal-comparative research design. Whether they are public or private, this study comprised every university in the Lahore district that is accredited by the Higher Education Commission (HEC). A multistage random sampling technique used to ascertain the sample size. Universities were divided into two strata during the first phase: private and state ones. Three universities from every strata were selected during the second step depending on convenience. In the third step, fifty students in all—drawn at random from every university—were selected. From public and private institutions, there were three hundred applicants overall. The instruments were validated by expert assessment; feedback from the experts guided later improvements. Descriptive and inferential analysis among other statistical methods was applied to the data. Among these approaches were frequency, percentage, mean, standard deviation, regression, independent sample t-test, and Pearson product moment correlation. 

Based on the results, students mostly use emotional suppression techniques and cognitive reappraisal as their means of controlling their emotions. While emotional repression is the disposition to control both positive and negative emotional expression, cognitive reappraisal is the act of changing one's thinking to magnify happy emotions or reinterpret unpleasant events. These strategies underline students' capacity to adaptively moderate academic and social pressures as well as their great relevance of cognitive processes in the control of emotions. Discoveries

               Using inferential statistics to investigate the frequencies, standard deviation, and mean of the demographic variables helped one to arrive numerous conclusions from the data.

1. According to the statistics, students of universities usually rely on emotional control techniques like emotional suppression and cognitive reappraisal. To keep a calm mood, students routinely use cognitive techniques including changing their views to magnify positive emotions and reinterpret challenging events. Moreover, there is a clear tendency to suppress both positive and negative emotions, implying a propensity for control of emotional expression. These results underline the need of cognitive reappraisal and suppression strategies in the emotional control mechanisms of pupils. They argue that more study is required to know how these approaches influence academic success as well as emotional well-being. More investigation should look at more broadly the effectiveness of treatments meant to help college students develop better emotional control strategies. 2. The information from the table highlights noteworthy traits including effective stress coping strategies and happiness with the diversity on university, therefore reflecting an overall optimistic view on social adaption among college students. Students show resilience by efficiently controlling their social events and giving their well-being first priority, so fostering a good integration that supports academic performance. Still, there are challenges—more especially, in ensuring consistent academic support throughout the faculty and in building more strong departmental ties. These findings point forth opportunities for colleges to increase their community involvement, customize support systems, and create a more homogeneous campus environment. Emphasizing these areas would improve students' general welfare and support their academic and personal development while they are still in university. 3. The statistically significant but practically minor link found in the regression study looking at how emotional control affects university students' interpersonal abilities Emotional control does not predict or influence interpersonal abilities, even if the model fit is only marginally good and there is a low amount of explained variance in these skills. The findings show that although emotional control is important, other factors beyond this aspect have a more major influence on students' interpersonal capacity. These results highlight the need of looking into other elements that can help to better grasp the complexity of the evolution of interpersonal skills among university students. 4. The statistically significant but modest association found in the regression analysis examining the effect of emotional regulation on the emotional awareness of university students exposes Although emotional control explains a small portion of the variance in emotional awareness, it is favourably linked with this aspect of students' emotional growth. Still, the influence size is quite small, suggesting that even if emotional control has a role, there are probably other unstated factors influencing pupils' degree of emotional awareness. Thus, more research is absolutely essential to investigate these additional elements and their combined influence on emotional development in educational environments. 5. The regression analysis in Table 4.7 shows among university students a noteworthy link between emotional control and self-awareness. Emotional control especially explains about 14.3% of the variance in self-awareness. The interesting findings underline the link between improved degrees of emotional control and more self-awareness, therefore stressing the critical relevance of these skills in students' introspection of their thoughts and feelings. The study acknowledges the influence of unmeasured variables but stresses the benefits of including emotional control training into educational systems to raise students's self-awareness and psychological resilience in academic surroundings. 6. Table 4.8's findings show among university students a statistically significant but small link between emotional regulation and emotional states. Though emotional control only explains a small fraction of the fluctuations in emotional states, the study shows a minor trend for emotional states to grow as emotional regulation levels rise. Still, the practical relevance of this relationship is low, suggesting that emotional control by itself might not have much effect on general emotional states. This emphasises the need of future research into other factors that can have more significant effects on students' emotional experiences and responses in academic surroundings. More research could look at these nuances to offer more accurate insights for treatments meant to raise students' emotional well-being. 7. The findings of Table 4.9 show among university students a weak relationship between social ability and emotional control. The impact is negligible and the statistical significance is just slightly significant (p = 0.050), even if the regression study shows that emotional control has some potential to explain social skills. This suggests that although emotional control might have some effect on social skills, the observed link is not sufficiently high to satisfy statistical significance criteria. The continuous term in the data points to a constantly high degree of social skills, which complicates the study. This suggests that despite their degree of emotional regulation, children have great social competencies. Later studies could look at other factors to expose more complex knowledge about the factors influencing social skills in university environments, therefore transcending the single emphasis on emotional control. 8. The results of Table 4.10 highlight the important effect of emotional control on the effectiveness of motivating treatments for university students. Since it explains 39% of the variation in motivating intervention results, the regression analysis emphasises the major impact of emotional control on raising student motivation. The great degree of statistical significance (p 0.001) offers more proof that the efficacy of motivating techniques meant to raise engagement and motivation also improves when students' emotional control skills improve. These results emphasise the need of including emotional control strategies into instructional activities to improve their effectiveness and finally support higher student accomplishment in academic surroundings. 9. Though the effect is modest, the regression analysis of Table 4.11 shows a statistically significant link between the classroom environment and the social situation faced by university students. According to the study, around 2.3% of the variations in social dynamics among students can be ascribed to the classroom environment. This suggests that changing or enhancing the classroom environment might affect the social experiences of the pupils. Though the classroom setting influences things, it is clear that other elements also play major roles in determining social settings in colleges. This underlines the complex nature of student social interactions and the need of thorough plans including both environmental factors and more general contextual implications to properly enhance social experiences on university. More thorough understanding of improving the social environment inside educational contexts could come from more research of extra factors. 10. Strong link between social aptitude and the self-regulation/social adjustment of university students indicates by the regression analysis of Table 4.12. About 34.1% of the variations in these results can be attributed to social ability. Higher degrees of social ability and better self-regulation and social adjustment skills in students are suggested by statistically significant results with an F-value of 117.424 (p < 0.001). This emphasises the important part social skills play in determining students' emotional regulation, behavioural adaption, and overall social integration inside the university environment. These findings suggest that by means of targeted interventions, increasing social capabilities could significantly increase students' capacity to navigate social interactions and academic challenges. More research and useful applications focused on developing and maintaining social skills could help students in both personal and academic spheres get better results. 11. Table 4.13's regression analysis shows a significant relationship between the classroom setting and the availability of social support for university students. This association explains over 13.1% of the fluctuation in this specific component. The very high F-value of 34.945 (p < 0.001) indicates that improvements in the classroom environment might lead to more availability of social support services for pupils. The favourable connection (B = 0.174, p = 0.001) suggests that changes or additions made in educational environments could favourably affect the accessibility and effectiveness of social assistance programmes. Although the effect size is modest, these findings underline how important the classroom environment is in forming the support systems accessible to students. More research on other factors could provide better understanding of improving social support systems in university settings, therefore creating a more inclusive and encouraging environment fit for student success and welfare. 12. With a Pearson correlation coefficient of 0.654 (p = 0.001), Table 4.14 shows a quite strong positive association between university students' social adjustment and their social ability. This link suggests that among university students, better social adjustment is associated with higher degrees of social aptitude. The strong correlation emphasises the crucial importance of social skills in determining the degree of social adaptation of students towards their academic surroundings. These results suggest that interventions emphasising the development and reinforcement of students' social abilities could help them become more generally socially adapted and well-adjusted across their stay in university. This association analysis shows strong data contradicting the null hypothesis (Ho3.10), so stressing the need of developing students' social competencies to enable their smooth integration into higher education surroundings. 13. With a r = 0.686, p = 0.001, the data in Table 4.15 offers strong evidence of a significant and favourable link between the comprehensive emotional control scores of university students and their view of the quality of the classroom environment. This link emphasises that students who show more degrees of emotional control tend to see the quality of the classroom environment in a more favourable way. This finding highlights how important emotional control skills are to students' subjective impressions of their classroom. The findings of this study underline the need of teaching emotional control techniques to pupils since they suggest that doing so can help them to view the classroom surroundings. This is validated by the null hypothesis (Ho3.11) being rejected. Finally, our results show the possible benefits of motivating emotional regulation as a tactic to improve students' whole academic performance in higher education situations. 14. The data shown in Table 4.16 shows a strong and substantial positive link (r = 0.798, p = 0.001) between the levels of emotional control of university professors (TER) and student assessments of the quality of the classroom environment (TC). This link suggests that teachers who show better emotional control have a tendency to set classroom environments in which students find general quality more pleasing. This realisation emphasises the critical need of teachers' capacity to control their emotions in determining how their impressions in the classroom affect the educational process of their students. By rejecting the null hypothesis (Ho3.12), these results highlight the need of teaching teachers emotional management techniques. These projects could improve classroom dynamics, raise student participation, and create more favourable learning environments. Finally, our results show that funding instructors' capacity to control their emotions could have major benefits for improving student performance in higher education and thereby supporting good teaching. In summary The careful regression studies on emotional control and their impact on many facets of university students' experiences yield several interesting results. Above all, the studies show that students mostly rely on cognitive reappraisal and emotional suppression strategies to control their feelings. While emotional suppression indicates a tendency to limit both good and negative emotional responses, cognitive reappraisal is the modification of one's beliefs to increase pleasant emotions or reinterpret difficult events. These strategies underline the need of cognitive processes in controlling emotions among students by stressing their capacity to dynamically control social and academic pressures. The results also show complex relationships between emotional control and significant results like emotional awareness, interpersonal abilities, and social adaptation. Though these traits and emotional regulation have statistically significant links, their practical influence is different. While emotional awareness and self-awareness benefit from emotional control, its influence on interpersonal skills and overall social adjustment is somewhat small. This suggests that, although emotional control is a factor, there are other unidentified factors with great influence on students' interpersonal skills and social integration in university environments. Moreover, the data show the clear correlation between better degrees of emotional control and students' assessments of the classroom surroundings. Higher emotional control students usually view their learning environment in a positive light. This implies that instruction in emotional control could enhance students' whole educational process. Moreover, the studies underline the important influence of teachers' emotional control on the development of classroom dynamics and student viewpoints. Advanced emotional management techniques help teachers create settings that pupils find to be very favourable for learning. This suggests that teaching efficacy and student involvement could be much enhanced by means of investments in the emotional competencies of teachers. To put it simply, students who want to control their emotions and have good learning experiences must be emotionally literate. Emotional control, however, has a complex and multidimensional effect on many results depending on several interactions. By encouraging students' emotional well-being, hence enhancing classroom dynamics, and finally resulting in increased academic performance and personal development in higher education environments, including emotional regulation training into educational processes can have major benefits. Other studies are recommended to look at other elements and enhance treatments meant to support efficient skills for emotional regulation among university students. Speak. Based on the findings of regression analyses on emotional control among university students, this paper emphasises significant consequences for theory and practice in the field of educational psychology. Consistent with earlier studies stressing the cognitive elements of emotional control, the results show that students mostly use cognitive reappraisal and emotional suppression approaches to regulate their emotions (Gross, 1998; Tamir, 2016). This helps to underline the idea that students actively use cognitive strategies to properly control social and academic demands, therefore influencing their emotional state and general academic performance. Moreover, the results of the research show complex relationships between emotional awareness, interpersonal skills, social adaptation, and emotional regulation among numerous outcomes. Although emotional control and these areas have statistically significant links, the real impact differs, which fits past studies (Brackett et al., 2011; Brackett & Rivers, 2014). For instance, emotional awareness and perceived classroom quality show a clear positive correlation with emotional regulation. Its effect on social adaptation and personal talents is less noteworthy, though. This is in line with current studies implying social functioning and self-awareness depend on emotional regulation. Nonetheless, as Brackett and Salovey (2006) and Salovey and Mayer (1990) suggest, emotional control is only one of several factors influencing these nuanced ideas. Furthermore underlined in the study is the need of teachers' emotional management in determining the classroom environment and student impressions of it. This finding is consistent with research underlining the major influence of teachers' emotional capacity in creating positive learning environments and student participation (Brackett et al., 2010; Jennings & Greenberg, 2009). The study underlines how important it is for teachers to have emotional abilities in order to create encouraging learning surroundings that advance the emotional well-being of their students. Higher instructor emotional control and better classroom quality—perceived by the students—show evidence of this. In the end, this paper offers insightful analysis on the need of emotional control in higher education. Moreover, it builds on and conforms to earlier studies results. This study emphasises in educational environments the complicated influence of emotional regulation on several student outcomes, including emotional awareness and classroom impressions. With the intention of building more loving and effective learning environments, future studies should investigate other elements and techniques to help teachers and students to enhance their capacity for emotional regulation. ideas Based on the results of the research on emotional regulation among university students, the following recommendations arise: Include structured emotional control courses into courses to give students practical tools for managing social-emotional and academic challenges. • Provide teachers with professional development opportunities to enhance their capacity for emotional control, therefore fostering a classroom that supports student participation and welfare. • Encourage among students the application of cognitive reappraisal strategies by means of seminars or counselling sessions, stressing the benefits of changing perspectives to increase emotional resilience. • Provide a broad spectrum of support services that meet kids' academic and emotional needs thereby fostering their whole growth and success. • Considering different student demographics and educational settings, allocate resources to perform additional study aiming at investigating effective intervention strategies that increase emotional control in university students.

Quantitative messages

A time series illustrated with a line chart demonstrating trends in U.S. federal spending and revenue over time.
A scatterplot illustrating the correlation between two variables (inflation and unemployment) measured at points in time.

Stephen Few described eight types of quantitative messages that users may attempt to understand or communicate from a set of data and the associated graphs used to help communicate the message.[48] Customers specifying requirements and analysts performing the data analysis may consider these messages during the course of the process.[49]

  1. Time-series: A single variable is captured over a period of time, such as the unemployment rate over a 10-year period. A line chart may be used to demonstrate the trend.[50]
  2. Ranking: Categorical subdivisions are ranked in ascending or descending order, such as a ranking of sales performance (the measure) by salespersons (the category, with each salesperson a categorical subdivision) during a single period.[51] A bar chart may be used to show the comparison across the salespersons.[52]
  3. Part-to-whole: Categorical subdivisions are measured as a ratio to the whole (i.e., a percentage out of 100%). A pie chart or bar chart can show the comparison of ratios, such as the market share represented by competitors in a market.[53]
  4. Deviation: Categorical subdivisions are compared against a reference, such as a comparison of actual vs. budget expenses for several departments of a business for a given time period. A bar chart can show the comparison of the actual versus the reference amount.[54]
  5. Frequency distribution: Shows the number of observations of a particular variable for a given interval, such as the number of years in which the stock market return is between intervals such as 0–10%, 11–20%, etc. A histogram, a type of bar chart, may be used for this analysis.[55]
  6. Correlation: Comparison between observations represented by two variables (X,Y) to determine if they tend to move in the same or opposite directions. For example, plotting unemployment (X) and inflation (Y) for a sample of months. A scatter plot is typically used for this message.[56]
  7. Nominal comparison: Comparing categorical subdivisions in no particular order, such as the sales volume by product code. A bar chart may be used for this comparison.[57]
  8. Geographic or geospatial: Comparison of a variable across a map or layout, such as the unemployment rate by state or the number of persons on the various floors of a building. A cartogram is a typical graphic used.[58][59]

Analyzing quantitative data

Author Jonathan Koomey has recommended a series of best practices for understanding quantitative data.[60] These include:

  • Check raw data for anomalies prior to performing an analysis;
  • Re-perform important calculations, such as verifying columns of data that are formula driven;
  • Confirm main totals are the sum of subtotals;
  • Check relationships between numbers that should be related in a predictable way, such as ratios over time;
  • Normalize numbers to make comparisons easier, such as analyzing amounts per person or relative to GDP or as an index value relative to a base year;
  • Break problems into component parts by analyzing factors that led to the results, such as DuPont analysis of return on equity.[25]

For the variables under examination, analysts typically obtain descriptive statistics for them, such as the mean (average), median, and standard deviation.[61] They may also analyze the distribution of the key variables to see how the individual values cluster around the mean.[62]

An illustration of the MECE principle used for data analysis.

The consultants at McKinsey and Company named a technique for breaking a quantitative problem down into its component parts called the MECE principle.[63] Each layer can be broken down into its components; each of the sub-components must be mutually exclusive of each other and collectively add up to the layer above them.[64] The relationship is referred to as "Mutually Exclusive and Collectively Exhaustive" or MECE. For example, profit by definition can be broken down into total revenue and total cost.[65] In turn, total revenue can be analyzed by its components, such as the revenue of divisions A, B, and C (which are mutually exclusive of each other) and should add to the total revenue (collectively exhaustive).[66]

Analysts may use robust statistical measurements to solve certain analytical problems.[67] Hypothesis testing is used when a particular hypothesis about the true state of affairs is made by the analyst and data is gathered to determine whether that state of affairs is true or false.[68][69] For example, the hypothesis might be that "Unemployment has no effect on inflation", which relates to an economics concept called the Phillips Curve.[70] Hypothesis testing involves considering the likelihood of Type I and type II errors, which relate to whether the data supports accepting or rejecting the hypothesis.[71][72]

Regression analysis may be used when the analyst is trying to determine the extent to which independent variable X affects dependent variable Y (e.g., "To what extent do changes in the unemployment rate (X) affect the inflation rate (Y)?").[73] This is an attempt to model or fit an equation line or curve to the data, such that Y is a function of X.[74][75]

Necessary condition analysis (NCA) may be used when the analyst is trying to determine the extent to which independent variable X allows variable Y (e.g., "To what extent is a certain unemployment rate (X) necessary for a certain inflation rate (Y)?").[73] Whereas (multiple) regression analysis uses additive logic where each X-variable can produce the outcome and the X's can compensate for each other (they are sufficient but not necessary),[76] necessary condition analysis (NCA) uses necessity logic, where one or more X-variables allow the outcome to exist, but may not produce it (they are necessary but not sufficient). Each single necessary condition must be present and compensation is not possible.[77]

Analytical activities of data users

Analytic activities of data visualization users

Users may have particular data points of interest within a data set, as opposed to the general messaging outlined above. Such low-level user analytic activities are presented in the following table. The taxonomy can also be organized by three poles of activities: retrieving values, finding data points, and arranging data points.[78][79][80][81]

# Task General
Description
Pro Forma
Abstract
Examples
1 Retrieve Value Given a set of specific cases, find attributes of those cases. What are the values of attributes {X, Y, Z, ...} in the data cases {A, B, C, ...}? - What is the mileage per gallon of the Ford Mondeo?

- How long is the movie Gone with the Wind?

2 Filter Given some concrete conditions on attribute values, find data cases satisfying those conditions. Which data cases satisfy conditions {A, B, C...}? - What Kellogg's cereals have high fiber?

- What comedies have won awards?

- Which funds underperformed the SP-500?

3 Compute Derived Value Given a set of data cases, compute an aggregate numeric representation of those data cases. What is the value of aggregation function F over a given set S of data cases? - What is the average calorie content of Post cereals?

- What is the gross income of all stores combined?

- How many manufacturers of cars are there?

4 Find Extremum Find data cases possessing an extreme value of an attribute over its range within the data set. What are the top/bottom N data cases with respect to attribute A? - What is the car with the highest MPG?

- What director/film has won the most awards?

- What Marvel Studios film has the most recent release date?

5 Sort Given a set of data cases, rank them according to some ordinal metric. What is the sorted order of a set S of data cases according to their value of attribute A? - Order the cars by weight.

- Rank the cereals by calories.

6 Determine Range Given a set of data cases and an attribute of interest, find the span of values within the set. What is the range of values of attribute A in a set S of data cases? - What is the range of film lengths?

- What is the range of car horsepowers?

- What actresses are in the data set?

7 Characterize Distribution Given a set of data cases and a quantitative attribute of interest, characterize the distribution of that attribute's values over the set. What is the distribution of values of attribute A in a set S of data cases? - What is the distribution of carbohydrates in cereals?

- What is the age distribution of shoppers?

8 Find Anomalies Identify any anomalies within a given set of data cases with respect to a given relationship or expectation, e.g. statistical outliers. Which data cases in a set S of data cases have unexpected/exceptional values? - Are there exceptions to the relationship between horsepower and acceleration?

- Are there any outliers in protein?

9 Cluster Given a set of data cases, find clusters of similar attribute values. Which data cases in a set S of data cases are similar in value for attributes {X, Y, Z, ...}? - Are there groups of cereals w/ similar fat/calories/sugar?

- Is there a cluster of typical film lengths?

10 Correlate Given a set of data cases and two attributes, determine useful relationships between the values of those attributes. What is the correlation between attributes X and Y over a given set S of data cases? - Is there a correlation between carbohydrates and fat?

- Is there a correlation between country of origin and MPG?

- Do different genders have a preferred payment method?

- Is there a trend of increasing film length over the years?

11 Contextualization[81] Given a set of data cases, find contextual relevancy of the data to the users. Which data cases in a set S of data cases are relevant to the current users' context? - Are there groups of restaurants that have foods based on my current caloric intake?

Barriers to effective analysis

Barriers to effective analysis may exist among the analysts performing the data analysis or among the audience. Distinguishing fact from opinion, cognitive biases, and innumeracy are all challenges to sound data analysis.[82]

Confusing fact and opinion

You are entitled to your own opinion, but you are not entitled to your own facts.

Effective analysis requires obtaining relevant facts to answer questions, support a conclusion or formal opinion, or test hypotheses.[83][84] Facts by definition are irrefutable, meaning that any person involved in the analysis should be able to agree upon them.[85] For example, in August 2010, the Congressional Budget Office (CBO) estimated that extending the Bush tax cuts of 2001 and 2003 for the 2011–2020 time period would add approximately $3.3 trillion to the national debt.[86] Everyone should be able to agree that indeed this is what CBO reported; they can all examine the report. This makes it a fact. Whether persons agree or disagree with the CBO is their own opinion.[87]

As another example, the auditor of a public company must arrive at a formal opinion on whether financial statements of publicly traded corporations are "fairly stated, in all material respects".[88] This requires extensive analysis of factual data and evidence to support their opinion. When making the leap from facts to opinions, there is always the possibility that the opinion is erroneous.[89]

Cognitive biases

There are a variety of cognitive biases that can adversely affect analysis. For example, confirmation bias is the tendency to search for or interpret information in a way that confirms one's preconceptions.[90] In addition, individuals may discredit information that does not support their views.[91]

Analysts may be trained specifically to be aware of these biases and how to overcome them.[92] In his book Psychology of Intelligence Analysis, retired CIA analyst Richards Heuer wrote that analysts should clearly delineate their assumptions and chains of inference and specify the degree and source of the uncertainty involved in the conclusions.[93] He emphasized procedures to help surface and debate alternative points of view.[94]

Innumeracy

Effective analysts are generally adept with a variety of numerical techniques. However, audiences may not have such literacy with numbers or numeracy; they are said to be innumerate.[95] Persons communicating the data may also be attempting to mislead or misinform, deliberately using bad numerical techniques.[96]

For example, whether a number is rising or falling may not be the key factor. More important may be the number relative to another number, such as the size of government revenue or spending relative to the size of the economy (GDP) or the amount of cost relative to revenue in corporate financial statements.[97] This numerical technique is referred to as normalization[25] or common-sizing. There are many such techniques employed by analysts, whether adjusting for inflation (i.e., comparing real vs. nominal data) or considering population increases, demographics, etc.[98] Analysts apply a variety of techniques to address the various quantitative messages described in the section above.[99]

Analysts may also analyze data under different assumptions or scenario. For example, when analysts perform financial statement analysis, they will often recast the financial statements under different assumptions to help arrive at an estimate of future cash flow, which they then discount to present value based on some interest rate, to determine the valuation of the company or its stock.[100][101] Similarly, the CBO analyzes the effects of various policy options on the government's revenue, outlays and deficits, creating alternative future scenarios for key measures.[102]

Other topics

Smart buildings

A data analytics approach can be used in order to predict energy consumption in buildings.[103] The different steps of the data analysis process are carried out in order to realise smart buildings, where the building management and control operations including heating, ventilation, air conditioning, lighting and security are realised automatically by miming the needs of the building users and optimising resources like energy and time.[104]

Analytics and business intelligence

Analytics is the "extensive use of data, statistical and quantitative analysis, explanatory and predictive models, and fact-based management to drive decisions and actions." It is a subset of business intelligence, which is a set of technologies and processes that uses data to understand and analyze business performance to drive decision-making .[105]

Education

In education, most educators have access to a data system for the purpose of analyzing student data.[106] These data systems present data to educators in an over-the-counter data format (embedding labels, supplemental documentation, and a help system and making key package/display and content decisions) to improve the accuracy of educators’ data analyses.[107]

Practitioner notes

This section contains rather technical explanations that may assist practitioners but are beyond the typical scope of a Wikipedia article.[108]

Initial data analysis

The most important distinction between the initial data analysis phase and the main analysis phase, is that during initial data analysis one refrains from any analysis that is aimed at answering the original research question.[109] The initial data analysis phase is guided by the following four questions:[110]

Quality of data

The quality of the data should be checked as early as possible. Data quality can be assessed in several ways, using different types of analysis: frequency counts, descriptive statistics (mean, standard deviation, median), normality (skewness, kurtosis, frequency histograms), normal imputation is needed.[111]

  • Analysis of extreme observations: outlying observations in the data are analyzed to see if they seem to disturb the distribution.[112]
  • Comparison and correction of differences in coding schemes: variables are compared with coding schemes of variables external to the data set, and possibly corrected if coding schemes are not comparable.[113]
  • Test for common-method variance.

The choice of analyses to assess the data quality during the initial data analysis phase depends on the analyses that will be conducted in the main analysis phase.[114]

Quality of measurements

The quality of the measurement instruments should only be checked during the initial data analysis phase when this is not the focus or research question of the study.[115][116] One should check whether structure of measurement instruments corresponds to structure reported in the literature.

There are two ways to assess measurement quality:

  • Confirmatory factor analysis
  • Analysis of homogeneity (internal consistency), which gives an indication of the reliability of a measurement instrument.[117] During this analysis, one inspects the variances of the items and the scales, the Cronbach's α of the scales, and the change in the Cronbach's alpha when an item would be deleted from a scale[118]

Initial transformations

After assessing the quality of the data and of the measurements, one might decide to impute missing data, or to perform initial transformations of one or more variables, although this can also be done during the main analysis phase.[119]
Possible transformations of variables are:[120]

  • Square root transformation (if the distribution differs moderately from normal)
  • Log-transformation (if the distribution differs substantially from normal)
  • Inverse transformation (if the distribution differs severely from normal)
  • Make categorical (ordinal / dichotomous) (if the distribution differs severely from normal, and no transformations help)

Did the implementation of the study fulfill the intentions of the research design?

One should check the success of the randomization procedure, for instance by checking whether background and substantive variables are equally distributed within and across groups.[121]
If the study did not need or use a randomization procedure, one should check the success of the non-random sampling, for instance by checking whether all subgroups of the population of interest are represented in sample.[122]
Other possible data distortions that should be checked are:

  • dropout (this should be identified during the initial data analysis phase)
  • Item non-response (whether this is random or not should be assessed during the initial data analysis phase)
  • Treatment quality (using manipulation checks).[123]

Characteristics of data sample

In any report or article, the structure of the sample must be accurately described.[124][125] It is especially important to exactly determine the structure of the sample (and specifically the size of the subgroups) when subgroup analyses will be performed during the main analysis phase.[126]
The characteristics of the data sample can be assessed by looking at:

  • Basic statistics of important variables
  • Scatter plots
  • Correlations and associations
  • Cross-tabulations[127]

Final stage of the initial data analysis

During the final stage, the findings of the initial data analysis are documented, and necessary, preferable, and possible corrective actions are taken.[128]
Also, the original plan for the main data analyses can and should be specified in more detail or rewritten.[129] In order to do this, several decisions about the main data analyses can and should be made:

  • In the case of non-normals: should one transform variables; make variables categorical (ordinal/dichotomous); adapt the analysis method?
  • In the case of missing data: should one neglect or impute the missing data; which imputation technique should be used?
  • In the case of outliers: should one use robust analysis techniques?
  • In case items do not fit the scale: should one adapt the measurement instrument by omitting items, or rather ensure comparability with other (uses of the) measurement instrument(s)?
  • In the case of (too) small subgroups: should one drop the hypothesis about inter-group differences, or use small sample techniques, like exact tests or bootstrapping?
  • In case the randomization procedure seems to be defective: can and should one calculate propensity scores and include them as covariates in the main analyses?[130]

Analysis

Several analyses can be used during the initial data analysis phase:[131]

  • Univariate statistics (single variable)
  • Bivariate associations (correlations)
  • Graphical techniques (scatter plots)

It is important to take the measurement levels of the variables into account for the analyses, as special statistical techniques are available for each level:[132]

  • Nominal and ordinal variables
    • Frequency counts (numbers and percentages)
    • Associations
      • circumambulations (crosstabulations)
      • hierarchical loglinear analysis (restricted to a maximum of 8 variables)
      • loglinear analysis (to identify relevant/important variables and possible confounders)
    • Exact tests or bootstrapping (in case subgroups are small)
    • Computation of new variables
  • Continuous variables
    • Distribution
      • Statistics (M, SD, variance, skewness, kurtosis)
      • Stem-and-leaf displays
      • Box plots

Nonlinear analysis

Nonlinear analysis is often necessary when the data is recorded from a nonlinear system. Nonlinear systems can exhibit complex dynamic effects including bifurcations, chaos, harmonics and subharmonics that cannot be analyzed using simple linear methods. Nonlinear data analysis is closely related to nonlinear system identification.[133]

Main data analysis

In the main analysis phase, analyses aimed at answering the research question are performed as well as any other relevant analysis needed to write the first draft of the research report.[134]

Exploratory and confirmatory approaches

In the main analysis phase, either an exploratory or confirmatory approach can be adopted. Usually the approach is decided before data is collected.[135] In an exploratory analysis no clear hypothesis is stated before analysing the data, and the data is searched for models that describe the data well.[136] In a confirmatory analysis clear hypotheses about the data are tested.[137]

Exploratory data analysis should be interpreted carefully. When testing multiple models at once there is a high chance on finding at least one of them to be significant, but this can be due to a type 1 error.[138] It is important to always adjust the significance level when testing multiple models with, for example, a Bonferroni correction.[139] Also, one should not follow up an exploratory analysis with a confirmatory analysis in the same dataset.[140] An exploratory analysis is used to find ideas for a theory, but not to test that theory as well.[140] When a model is found exploratory in a dataset, then following up that analysis with a confirmatory analysis in the same dataset could simply mean that the results of the confirmatory analysis are due to the same type 1 error that resulted in the exploratory model in the first place.[140] The confirmatory analysis therefore will not be more informative than the original exploratory analysis.[141]

Stability of results

It is important to obtain some indication about how generalizable the results are.[142] While this is often difficult to check, one can look at the stability of the results. Are the results reliable and reproducible? There are two main ways of doing that.[143]

  • Cross-validation. By splitting the data into multiple parts, we can check if an analysis (like a fitted model) based on one part of the data generalizes to another part of the data as well.[144] Cross-validation is generally inappropriate, though, if there are correlations within the data, e.g. with panel data.[145] Hence other methods of validation sometimes need to be used. For more on this topic, see statistical model validation.[146]
  • Sensitivity analysis. A procedure to study the behavior of a system or model when global parameters are (systematically) varied. One way to do that is via bootstrapping.[147]

Free software for data analysis

Notable free software for data analysis include:

  • DevInfo – A database system endorsed by the United Nations Development Group for monitoring and analyzing human development.[148]
  • ELKI – Data mining framework in Java with data mining oriented visualization functions.
  • KNIME – The Konstanz Information Miner, a user friendly and comprehensive data analytics framework.
  • Orange – A visual programming tool featuring interactive data visualization and methods for statistical data analysis, data mining, and machine learning.
  • Pandas – Python library for data analysis.
  • PAW – FORTRAN/C data analysis framework developed at CERN.
  • R – A programming language and software environment for statistical computing and graphics.[149]
  • ROOT – C++ data analysis framework developed at CERN.
  • SciPy – Python library for scientific computing.
  • Julia – A programming language well-suited for numerical analysis and computational science.

Reproducible analysis

The typical data analysis workflow involves collecting data, running analyses through various scripts, creating visualizations, and writing reports. However, this workflow presents challenges, including a separation between analysis scripts and data, as well as a gap between analysis and documentation. Often, the correct order of running scripts is only described informally or resides in the data scientist's memory. The potential for losing this information creates issues for reproducibility. To address these challenges, it is essential to have analysis scripts written for automated, reproducible workflows. Additionally, dynamic documentation is crucial, providing reports that are understandable by both machines and humans, ensuring accurate representation of the analysis workflow even as scripts evolve.[150]

International data analysis contests

Different companies or organizations hold data analysis contests to encourage researchers to utilize their data or to solve a particular question using data analysis.[151][152] A few examples of well-known international data analysis contests are as follows:[153]

See also

References

Citations

  1. ^ a b "Transforming Unstructured Data into Useful Information", Big Data, Mining, and Analytics, Auerbach Publications, pp. 227–246, 2014-03-12, doi:10.1201/b16666-14, ISBN 978-0-429-09529-0, retrieved 2021-05-29
  2. ^ "The Multiple Facets of Correlation Functions", Data Analysis Techniques for Physical Scientists, Cambridge University Press, pp. 526–576, 2017, doi:10.1017/9781108241922.013, ISBN 978-1-108-41678-8, retrieved 2021-05-29
  3. ^ Xia, B. S., & Gong, P. (2015). Review of business intelligence through data analysis. Benchmarking, 21(2), 300-311. doi:10.1108/BIJ-08-2012-0050
  4. ^ Exploring Data Analysis
  5. ^ "Data Coding and Exploratory Analysis (EDA) Rules for Data Coding Exploratory Data Analysis (EDA) Statistical Assumptions", SPSS for Intermediate Statistics, Routledge, pp. 42–67, 2004-08-16, doi:10.4324/9781410611420-6, ISBN 978-1-4106-1142-0, retrieved 2021-05-29
  6. ^ Spie (2014-10-01). "New European ICT call focuses on PICs, lasers, data transfer". SPIE Professional. doi:10.1117/2.4201410.10. ISSN 1994-4403.
  7. ^ Samandar, Petersson; Svantesson, Sofia (2017). Skapandet av förtroende inom eWOM : En studie av profilbildens effekt ur ett könsperspektiv. Högskolan i Gävle, Företagsekonomi. OCLC 1233454128.
  8. ^ Goodnight, James (2011-01-13). "The forecast for predictive analytics: hot and getting hotter". Statistical Analysis and Data Mining: The ASA Data Science Journal. 4 (1): 9–10. doi:10.1002/sam.10106. ISSN 1932-1864. S2CID 38571193.
  9. ^ Sherman, Rick (4 November 2014). Business intelligence guidebook: from data integration to analytics. Amsterdam. ISBN 978-0-12-411528-6. OCLC 894555128.{{cite book}}: CS1 maint: location missing publisher (link)
  10. ^ Field, John (2009), "Dividing listening into its components", Listening in the Language Classroom, Cambridge: Cambridge University Press, pp. 96–109, doi:10.1017/cbo9780511575945.008, ISBN 978-0-511-57594-5, retrieved 2021-05-29
  11. ^ a b c Judd, Charles; McCleland, Gary (1989). Data Analysis. Harcourt Brace Jovanovich. ISBN 0-15-516765-0.
  12. ^ Tukey, John W. (March 1962). "John Tukey-The Future of Data Analysis-July 1961". The Annals of Mathematical Statistics. 33 (1): 1–67. doi:10.1214/aoms/1177704711. Archived from the original on 2020-01-26. Retrieved 2015-01-01.
  13. ^ a b c d e f g Schutt, Rachel; O'Neil, Cathy (2013). Doing Data Science. O'Reilly Media. ISBN 978-1-449-35865-5.
  14. ^ "USE OF THE DATA", Handbook of Petroleum Product Analysis, Hoboken, NJ: John Wiley & Sons, Inc, pp. 296–303, 2015-02-06, doi:10.1002/9781118986370.ch18, ISBN 978-1-118-98637-0, retrieved 2021-05-29
  15. ^ Ainsworth, Penne (20 May 2019). Introduction to accounting : an integrated approach. John Wiley & Sons. ISBN 978-1-119-60014-5. OCLC 1097366032.
  16. ^ Margo, Robert A. (2000). Wages and labor markets in the United States, 1820-1860. University of Chicago Press. ISBN 0-226-50507-3. OCLC 41285104.
  17. ^ Olusola, Johnson Adedeji; Shote, Adebola Adekunle; Ouigmane, Abdellah; Isaifan, Rima J. (7 May 2021). "Table 1: Data type and sources of data collected for this research". PeerJ. 9: e11387. doi:10.7717/peerj.11387/table-1.
  18. ^ MacPherson, Derek (2019-10-16), "Information Technology Analysts' Perspectives", Data Strategy in Colleges and Universities, Routledge, pp. 168–183, doi:10.4324/9780429437564-12, ISBN 978-0-429-43756-4, S2CID 211738958, retrieved 2021-05-29
  19. ^ Nelson, Stephen L. (2014). Excel data analysis for dummies. Wiley. ISBN 978-1-118-89810-9. OCLC 877772392.
  20. ^ "Figure 3—source data 1. Raw and processed values obtained through qPCR". 30 August 2017. doi:10.7554/elife.28468.029. {{cite journal}}: Cite journal requires |journal= (help)
  21. ^ a b Bohannon, John (2016-02-24). "Many surveys, about one in five, may contain fraudulent data". Science. doi:10.1126/science.aaf4104. ISSN 0036-8075.
  22. ^ Jeannie Scruggs, Garber; Gross, Monty; Slonim, Anthony D. (2010). Avoiding common nursing errors. Wolters Kluwer Health/Lippincott Williams & Wilkins. ISBN 978-1-60547-087-0. OCLC 338288678.
  23. ^ "Data Cleaning". Microsoft Research. Archived from the original on 29 October 2013. Retrieved 26 October 2013.
  24. ^ Hancock, R.G.V.; Carter, Tristan (February 2010). "How reliable are our published archaeometric analyses? Effects of analytical techniques through time on the elemental analysis of obsidians". Journal of Archaeological Science. 37 (2): 243–250. Bibcode:2010JArSc..37..243H. doi:10.1016/j.jas.2009.10.004. ISSN 0305-4403.
  25. ^ a b c "Perceptual Edge-Jonathan Koomey-Best practices for understanding quantitative data-February 14, 2006" (PDF). Archived (PDF) from the original on October 5, 2014. Retrieved November 12, 2014.
  26. ^ Peleg, Roni; Avdalimov, Angelika; Freud, Tamar (2011-03-23). "Providing cell phone numbers and email addresses to Patients: the physician's perspective". BMC Research Notes. 4 (1): 76. doi:10.1186/1756-0500-4-76. ISSN 1756-0500. PMC 3076270. PMID 21426591.
  27. ^ Goodman, Lenn Evan (1998). Judaism, human rights, and human values. Oxford University Press. ISBN 0-585-24568-1. OCLC 45733915.
  28. ^ Hanzo, Lajos. "Blind joint maximum likelihood channel estimation and data detection for single-input multiple-output systems". doi:10.1049/iet-tv.44.786. Retrieved 2021-05-29. {{cite journal}}: Cite journal requires |journal= (help)
  29. ^ Hellerstein, Joseph (27 February 2008). "Quantitative Data Cleaning for Large Databases" (PDF). EECS Computer Science Division: 3. Archived (PDF) from the original on 13 October 2013. Retrieved 26 October 2013.
  30. ^ Davis, Steve; Pettengill, James B.; Luo, Yan; Payne, Justin; Shpuntoff, Al; Rand, Hugh; Strain, Errol (26 August 2015). "CFSAN SNP Pipeline: An automated method for constructing SNP matrices from next-generation sequence data". PeerJ Computer Science. 1: e20. doi:10.7717/peerj-cs.20/supp-1.
  31. ^ "FTC requests additional data". Pump Industry Analyst. 1999 (48): 12. December 1999. doi:10.1016/s1359-6128(99)90509-8. ISSN 1359-6128.
  32. ^ "Exploring your Data with Data Visualization & Descriptive Statistics: Common Descriptive Statistics for Quantitative Data". 2017. doi:10.4135/9781529732795. {{cite journal}}: Cite journal requires |journal= (help)
  33. ^ Murray, Daniel G. (2013). Tableau your data! : fast and easy visual analysis with Tableau Software. J. Wiley & Sons. ISBN 978-1-118-61204-0. OCLC 873810654.
  34. ^ Ben-Ari, Mordechai (2012), "First-Order Logic: Formulas, Models, Tableaux", Mathematical Logic for Computer Science, London: Springer London, pp. 131–154, doi:10.1007/978-1-4471-4129-7_7, ISBN 978-1-4471-4128-0, retrieved 2021-05-31
  35. ^ Sosa, Ernest (2011). Causation. Oxford Univ. Press. ISBN 978-0-19-875094-9. OCLC 767569031.
  36. ^ Evans, Michelle V.; Dallas, Tad A.; Han, Barbara A.; Murdock, Courtney C.; Drake, John M. (28 February 2017). Brady, Oliver (ed.). "Figure 2. Variable importance by permutation, averaged over 25 models". eLife. 6: e22053. doi:10.7554/elife.22053.004.
  37. ^ Watson, Kevin; Halperin, Israel; Aguilera-Castells, Joan; Iacono, Antonio Dello (12 November 2020). "Table 3: Descriptive (mean ± SD), inferential (95% CI) and qualitative statistics (ES) of all variables between self-selected and predetermined conditions". PeerJ. 8: e10361. doi:10.7717/peerj.10361/table-3.
  38. ^ Cortés-Molino, Álvaro; Aulló-Maestro, Isabel; Fernandez-Luque, Ismael; Flores-Moya, Antonio; Carreira, José A.; Salvo, A. Enrique (22 October 2020). "Table 3: Best regression models between LIDAR data (independent variable) and field-based Forestereo data (dependent variable), used to map spatial distribution of the main forest structure variables". PeerJ. 8: e10158. doi:10.7717/peerj.10158/table-3.
  39. ^ International Sales Terms, Beck/Hart, 2014, doi:10.5040/9781472561671.ch-003, ISBN 978-1-4725-6167-1, retrieved 2021-05-31
  40. ^ Nwabueze, JC (2008-05-21). "Performances of estimators of linear model with auto-correlated error terms when the independent variable is normal". Journal of the Nigerian Association of Mathematical Physics. 9 (1). doi:10.4314/jonamp.v9i1.40071. ISSN 1116-4336.
  41. ^ Conway, Steve (2012-07-04). "A Cautionary Note on Data Inputs and Visual Outputs in Social Network Analysis". British Journal of Management. 25 (1): 102–117. doi:10.1111/j.1467-8551.2012.00835.x. hdl:2381/36068. ISSN 1045-3172. S2CID 154347514.
  42. ^ "Customer Purchases and Other Repeated Events", Data Analysis Using SQL and Excel®, Indianapolis, Indiana: John Wiley & Sons, Inc., pp. 367–420, 2016-01-29, doi:10.1002/9781119183419.ch8, ISBN 978-1-119-18341-9, retrieved 2021-05-31
  43. ^ Grandjean, Martin (2014). "La connaissance est un réseau" (PDF). Les Cahiers du Numérique. 10 (3): 37–54. doi:10.3166/lcn.10.3.37-54. Archived (PDF) from the original on 2015-09-27. Retrieved 2015-05-05.
  44. ^ Data requirements for semiconductor die. Exchange data formats and data dictionary, BSI British Standards, doi:10.3403/02271298, retrieved 2021-05-31
  45. ^ Yee, D. (1985-04-01). "How to Communicate Your Message to an Audience Effectively". The Gerontologist. 25 (2): 209. doi:10.1093/geront/25.2.209. ISSN 0016-9013.
  46. ^ Bemowska-Kałabun, Olga; Wąsowicz, Paweł; Napora-Rutkowski, Łukasz; Nowak-Życzyńska, Zuzanna; Wierzbicka, Małgorzata (11 June 2019). "Supplemental Information 1: Raw data for charts and tables". doi:10.7287/peerj.preprints.27793v1/supp-1. {{cite journal}}: Cite journal requires |journal= (help)
  47. ^ Visualizing Data About UK Museums: Bar Charts, Line Charts and Heat Maps. 2021. doi:10.4135/9781529768749. ISBN 9781529768749. S2CID 240967380.
  48. ^ Tunqui Neira, José Manuel (2019-09-19). "Thank you for your review. Please find in the attached pdf file a detailed response to the points you raised". doi:10.5194/hess-2019-325-ac2. S2CID 241041810. {{cite journal}}: Cite journal requires |journal= (help)
  49. ^ Brackett, John W. (1989), "Performing Requirements Analysis Project Courses for External Customers", Issues in Software Engineering Education, New York, NY: Springer New York, pp. 276–285, doi:10.1007/978-1-4613-9614-7_20, ISBN 978-1-4613-9616-1, retrieved 2021-06-03
  50. ^ Wyckhuys, Kris A. G.; Wongtiem, Prapit; Rauf, Aunu; Thancharoen, Anchana; Heimpel, George E.; Le, Nhung T. T.; Fanani, Muhammad Zainal; Gurr, Geoff M.; Lundgren, Jonathan G.; Burra, Dharani D.; Palao, Leo K.; Hyman, Glenn; Graziosi, Ignazio; Le, Vi X.; Cock, Matthew J. W.; Tscharntke, Teja; Wratten, Steve D.; Nguyen, Liem V.; You, Minsheng; Lu, Yanhui; Ketelaar, Johannes W.; Goergen, Georg; Neuenschwander, Peter (19 October 2018). "Figure 2: Bi-monthly mealybug population fluctuations in southern Vietnam, over a 2-year time period". PeerJ. 6: e5796. doi:10.7717/peerj.5796/fig-2.
  51. ^ Riehl, Emily (2014), "A sampling of 2-categorical aspects of quasi-category theory", Categorical Homotopy Theory, Cambridge: Cambridge University Press, pp. 318–336, doi:10.1017/cbo9781107261457.019, ISBN 978-1-107-26145-7, retrieved 2021-06-03
  52. ^ "X-Bar Chart". Encyclopedia of Production and Manufacturing Management. 2000. p. 841. doi:10.1007/1-4020-0612-8_1063. ISBN 978-0-7923-8630-8.
  53. ^ "Chart C5.3. Percentage of 15-19 year-olds not in education, by labour market status (2012)". doi:10.1787/888933119055. Retrieved 2021-06-03. {{cite journal}}: Cite journal requires |journal= (help)
  54. ^ "Chart 7: Households: final consumption expenditure versus actual individual consumption". doi:10.1787/665527077310. Retrieved 2021-06-03. {{cite journal}}: Cite journal requires |journal= (help)
  55. ^ Chao, Luke H.; Jang, Jaebong; Johnson, Adam; Nguyen, Anthony; Gray, Nathanael S.; Yang, Priscilla L.; Harrison, Stephen C. (12 July 2018). Jahn, Reinhard; Schekman, Randy (eds.). "Figure 4. Frequency of hemifusion (measured as DiD fluorescence dequenching) as a function of number of bound Alexa-fluor-555/3-110-22 molecules". eLife. 7: e36461. doi:10.7554/elife.36461.006.
  56. ^ Garnier, Elodie M.; Fouret, Nastasia; Descoins, Médéric (3 February 2020). "Table 2: Graph comparison between Scatter plot, Violin + Scatter plot, Heatmap and ViSiElse graph". PeerJ. 8: e8341. doi:10.7717/peerj.8341/table-2.
  57. ^ "Product comparison chart: Wearables". PsycEXTRA Dataset. 2009. doi:10.1037/e539162010-006. Retrieved 2021-06-03.
  58. ^ "Stephen Few-Perceptual Edge-Selecting the Right Graph for Your Message-2004" (PDF). Archived (PDF) from the original on 2014-10-05. Retrieved 2014-10-29.
  59. ^ "Stephen Few-Perceptual Edge-Graph Selection Matrix" (PDF). Archived (PDF) from the original on 2014-10-05. Retrieved 2014-10-29.
  60. ^ "Recommended Best Practices". 2008-10-01. doi:10.14217/9781848590151-8-en. Retrieved 2021-06-03. {{cite journal}}: Cite journal requires |journal= (help)
  61. ^ Hobold, Edilson; Pires-Lopes, Vitor; Gómez-Campos, Rossana; Arruda, Miguel de; Andruske, Cynthia Lee; Pacheco-Carrillo, Jaime; Cossio-Bolaños, Marco Antonio (30 November 2017). "Table 1: Descriptive statistics (mean ± standard-deviation) for somatic variables and physical fitness ítems for males and females". PeerJ. 5: e4032. doi:10.7717/peerj.4032/table-1.
  62. ^ Ablin, Jacob N.; Zohar, Ada H.; Zaraya-Blum, Reut; Buskila, Dan (13 September 2016). "Table 2: Cluster analysis presenting mean values of psychological variables per cluster group". PeerJ. 4: e2421. doi:10.7717/peerj.2421/table-2.
  63. ^ "Consultants Employed by McKinsey & Company", Organizational Behavior 5, Routledge, pp. 77–82, 2008-07-30, doi:10.4324/9781315701974-15, ISBN 978-1-315-70197-4, retrieved 2021-06-03
  64. ^ Antiphanes (2007), Olson, S. Douglas (ed.), "H6 Antiphanes fr.172.1-4, from Women Who Looked Like Each Other or Men Who Looked Like Each Other", Broken Laughter: Select Fragments of Greek Comedy, Oxford University Press, doi:10.1093/oseo/instance.00232915, ISBN 978-0-19-928785-7, retrieved 2021-06-03
  65. ^ Carey, Malachy (November 1981). "On Mutually Exclusive and Collectively Exhaustive Properties of Demand Functions". Economica. 48 (192): 407–415. doi:10.2307/2553697. ISSN 0013-0427. JSTOR 2553697.
  66. ^ "Total tax revenue". doi:10.1787/352874835867. Retrieved 2021-06-03. {{cite journal}}: Cite journal requires |journal= (help)
  67. ^ "Dual-use car may solve transportation problems". Chemical & Engineering News Archive. 46 (24): 44. 1968-06-03. doi:10.1021/cen-v046n024.p044. ISSN 0009-2347.
  68. ^ Heckman (1978). "Simple Statistical Models for Discrete Panel Data Developed and Applied to Test the Hypothesis of True State Dependence against the Hypothesis of Spurious State Dependence". Annales de l'inséé (30/31): 227–269. doi:10.2307/20075292. ISSN 0019-0209. JSTOR 20075292.
  69. ^ Koontz, Dean (2017). False Memory. Headline Book Publishing. ISBN 978-1-4722-4830-5. OCLC 966253202.
  70. ^ Munday, Stephen C. R. (1996), "Unemployment, Inflation and the Phillips Curve", Current Developments in Economics, London: Macmillan Education UK, pp. 186–218, doi:10.1007/978-1-349-24986-2_11, ISBN 978-0-333-64444-7, retrieved 2021-06-03
  71. ^ Louangrath, Paul I. (2013). "Alpha and Beta Tests for Type I and Type II Inferential Errors Determination in Hypothesis Testing". SSRN Electronic Journal. doi:10.2139/ssrn.2332756. ISSN 1556-5068.
  72. ^ Walko, Ann M. (2006). Rejecting the second generation hypothesis : maintaining Estonian ethnicity in Lakewood, New Jersey. AMS Press. ISBN 0-404-19454-0. OCLC 467107876.
  73. ^ a b Yanamandra, Venkataramana (September 2015). "Exchange rate changes and inflation in India: What is the extent of exchange rate pass-through to imports?". Economic Analysis and Policy. 47: 57–68. doi:10.1016/j.eap.2015.07.004. ISSN 0313-5926.
  74. ^ Mudiyanselage, Nawarathna; Nawarathna, Pubudu Manoj. Characterization of epigenetic changes and their connection to gene expression abnormalities in clear cell renal cell carcinoma. OCLC 1190697848.
  75. ^ Moreno Delgado, David; Møller, Thor C.; Ster, Jeanne; Giraldo, Jesús; Maurel, Damien; Rovira, Xavier; Scholler, Pauline; Zwier, Jurriaan M.; Perroy, Julie; Durroux, Thierry; Trinquet, Eric; Prezeau, Laurent; Rondard, Philippe; Pin, Jean-Philippe (29 June 2017). Chao, Moses V (ed.). "Appendix 1—figure 5. Curve data included in Appendix 1—table 4 (solid points) and the theoretical curve by using the Hill equation parameters of Appendix 1—table 5 (curve line)". eLife. 6: e25233. doi:10.7554/elife.25233.027.
  76. ^ Feinmann, Jane. "How Can Engineers and Journalists Help Each Other?" (Video). The Institute of Engineering & Technology. doi:10.1049/iet-tv.48.859. Retrieved 2021-06-03.
  77. ^ Dul, Jan (2015). "Necessary Condition Analysis (NCA): Logic and Methodology of 'Necessary But Not Sufficient' Causality". SSRN Electronic Journal. doi:10.2139/ssrn.2588480. hdl:1765/77890. ISSN 1556-5068. S2CID 219380122.
  78. ^ Robert Amar, James Eagan, and John Stasko (2005) "Low-Level Components of Analytic Activity in Information Visualization" Archived 2015-02-13 at the Wayback Machine
  79. ^ William Newman (1994) "A Preliminary Analysis of the Products of HCI Research, Using Pro Forma Abstracts" Archived 2016-03-03 at the Wayback Machine
  80. ^ Mary Shaw (2002) "What Makes Good Research in Software Engineering?" Archived 2018-11-05 at the Wayback Machine
  81. ^ a b Yavari, Ali; Jayaraman, Prem Prakash; Georgakopoulos, Dimitrios; Nepal, Surya (2017). ConTaaS: An Approach to Internet-Scale Contextualisation for Developing Efficient Internet of Things Applications. Proceedings of the 50th Hawaii International Conference on System Sciences (HICSS50 2017). University of Hawaiʻi at Mānoa. doi:10.24251/HICSS.2017.715. hdl:10125/41879. ISBN 9780998133102.
  82. ^ "Connectivity tool transfers data among database and statistical products". Computational Statistics & Data Analysis. 8 (2): 224. July 1989. doi:10.1016/0167-9473(89)90021-2. ISSN 0167-9473.
  83. ^ "Information relevant to your job", Obtaining Information for Effective Management, Routledge, pp. 48–54, 2007-07-11, doi:10.4324/9780080544304-16 (inactive 2024-05-01), ISBN 978-0-08-054430-4, retrieved 2021-06-03{{citation}}: CS1 maint: DOI inactive as of May 2024 (link)
  84. ^ Lehmann, E. L. (2010). Testing statistical hypotheses. Springer. ISBN 978-1-4419-3178-8. OCLC 757477004.
  85. ^ Fielding, Henry (2008-08-14), "Consisting partly of facts, and partly of observations upon them", Tom Jones, Oxford University Press, doi:10.1093/owc/9780199536993.003.0193, ISBN 978-0-19-953699-3, retrieved 2021-06-03
  86. ^ "Congressional Budget Office-The Budget and Economic Outlook-August 2010-Table 1.7 on Page 24" (PDF). 18 August 2010. Archived from the original on 2012-02-27. Retrieved 2011-03-31.
  87. ^ "Students' sense of belonging, by immigrant background". PISA 2015 Results (Volume III). PISA. 2017-04-19. doi:10.1787/9789264273856-table125-en. ISBN 9789264273818. ISSN 1996-3777.
  88. ^ Gordon, Roger (March 1990). "Do Publicly Traded Corporations Act in the Public Interest?". National Bureau of Economic Research Working Papers. Cambridge, MA. doi:10.3386/w3303.
  89. ^ Minardi, Margot (2010-09-24), "Facts and Opinion", Making Slavery History, Oxford University Press, pp. 13–42, doi:10.1093/acprof:oso/9780195379372.003.0003, ISBN 978-0-19-537937-2, retrieved 2021-06-03
  90. ^ Rivard, Jillian R (2014). Confirmation bias in witness interviewing: Can interviewers ignore their preconceptions? (Thesis). Florida International University. doi:10.25148/etd.fi14071109.
  91. ^ Papineau, David (1988), "Does the Sociology of Science Discredit Science?", Relativism and Realism in Science, Dordrecht: Springer Netherlands, pp. 37–57, doi:10.1007/978-94-009-2877-0_2, ISBN 978-94-010-7795-8, retrieved 2021-06-03
  92. ^ Bromme, Rainer; Hesse, Friedrich W.; Spada, Hans, eds. (2005). Barriers and Biases in Computer-Mediated Knowledge Communication. doi:10.1007/b105100. ISBN 978-0-387-24317-7.
  93. ^ Heuer, Richards (2019-06-10). Heuer, Richards J (ed.). Quantitative Approaches to Political Intelligence. doi:10.4324/9780429303647. ISBN 9780429303647. S2CID 145675822.
  94. ^ "Introduction" (PDF). cia.gov. Archived (PDF) from the original on 2021-10-25. Retrieved 2021-10-25.
  95. ^ "Figure 6.7. Differences in literacy scores across OECD countries generally mirror those in numeracy". doi:10.1787/888934081549. Retrieved 2021-06-03. {{cite journal}}: Cite journal requires |journal= (help)
  96. ^ "Bloomberg-Barry Ritholz-Bad Math that Passes for Insight-October 28, 2014". Archived from the original on 2014-10-29. Retrieved 2014-10-29.
  97. ^ Gusnaini, Nuriska; Andesto, Rony; Ermawati (2020-12-15). "The Effect of Regional Government Size, Legislative Size, Number of Population, and Intergovernmental Revenue on The Financial Statements Disclosure". European Journal of Business and Management Research. 5 (6). doi:10.24018/ejbmr.2020.5.6.651. ISSN 2507-1076. S2CID 231675715.
  98. ^ Linsey, Julie S.; Becker, Blake (2011), "Effectiveness of Brainwriting Techniques: Comparing Nominal Groups to Real Teams", Design Creativity 2010, London: Springer London, pp. 165–171, doi:10.1007/978-0-85729-224-7_22, ISBN 978-0-85729-223-0, retrieved 2021-06-03
  99. ^ Lyon, J. (April 2006). "Purported Responsible Address in E-Mail Messages". doi:10.17487/rfc4407. {{cite journal}}: Cite journal requires |journal= (help)
  100. ^ Stock, Eugene (10 June 2017). The History of the Church Missionary Society Its Environment, its Men and its Work. Hansebooks GmbH. ISBN 978-3-337-18120-8. OCLC 1189626777.
  101. ^ Gross, William H. (July 1979). "Coupon Valuation and Interest Rate Cycles". Financial Analysts Journal. 35 (4): 68–71. doi:10.2469/faj.v35.n4.68. ISSN 0015-198X.
  102. ^ "25. General government total outlays". doi:10.1787/888932348795. Retrieved 2021-06-03. {{cite journal}}: Cite journal requires |journal= (help)
  103. ^ González-Vidal, Aurora; Moreno-Cano, Victoria (2016). "Towards energy efficiency smart buildings models based on intelligent data analytics". Procedia Computer Science. 83 (Elsevier): 994–999. doi:10.1016/j.procs.2016.04.213.
  104. ^ "Low-Energy Air Conditioning and Lighting Control", Building Energy Management Systems, Routledge, pp. 406–439, 2013-07-04, doi:10.4324/9780203477342-18 (inactive 2024-05-01), ISBN 978-0-203-47734-2, retrieved 2021-06-03{{citation}}: CS1 maint: DOI inactive as of May 2024 (link)
  105. ^ Davenport, Thomas; Harris, Jeanne (2007). Competing on Analytics. O'Reilly. ISBN 978-1-4221-0332-6.
  106. ^ Aarons, D. (2009). Report finds states on course to build pupil-data systems. Education Week, 29(13), 6.
  107. ^ Rankin, J. (2013, March 28). How data Systems & reports can either fight or propagate the data analysis error epidemic, and how educator leaders can help. Archived 2019-03-26 at the Wayback Machine Presentation conducted from Technology Information Center for Administrative Leadership (TICAL) School Leadership Summit.
  108. ^ Brödermann, Eckart J. (2018), "Article 2.2.1 (Scope of the Section)", Commercial Law, Nomos Verlagsgesellschaft mbH & Co. KG, p. 525, doi:10.5771/9783845276564-525, ISBN 978-3-8452-7656-4, retrieved 2021-06-03
  109. ^ Jaech, J.L. (1960-04-21). "Analysis of dimensional distortion data from initial 24 quality certification tubes". doi:10.2172/10170345. S2CID 110058009. {{cite journal}}: Cite journal requires |journal= (help)
  110. ^ Adèr 2008a, p. 337.
  111. ^ Kjell, Oscar N. E.; Thompson, Sam (19 December 2013). "Descriptive statistics indicating the mean, standard deviation and frequency of missing values for each condition (N = number of participants), and for the dependent variables (DV)". PeerJ. 1: e231. doi:10.7717/peerj.231/table-1.
  112. ^ Practice for Dealing With Outlying Observations, ASTM International, doi:10.1520/e0178-16a, retrieved 2021-06-03
  113. ^ "Alternative Coding Schemes for Dummy Variables", Regression with Dummy Variables, Newbury Park, CA: SAGE Publications, Inc., pp. 64–75, 1993, doi:10.4135/9781412985628.n5, ISBN 978-0-8039-5128-0, retrieved 2021-06-03
  114. ^ Adèr 2008a, pp. 338–341.
  115. ^ Danilyuk, P. M. (July 1960). "Computing the displacement of the initial contour of gears when they are checked by means of balls". Measurement Techniques. 3 (7): 585–587. doi:10.1007/bf00977716. ISSN 0543-1972. S2CID 121058145.
  116. ^ Newman, Isadore (1998). Qualitative-quantitative research methodology : exploring the interactive continuum. Southern Illinois University Press. ISBN 0-585-17889-5. OCLC 44962443.
  117. ^ Terwilliger, James S.; Lele, Kaustubh (June 1979). "Some Relationships Among Internal Consistency, Reproducibility, and Homogeneity". Journal of Educational Measurement. 16 (2): 101–108. doi:10.1111/j.1745-3984.1979.tb00091.x. ISSN 0022-0655.
  118. ^ Adèr 2008a, pp. 341–342.
  119. ^ Adèr 2008a, p. 344.
  120. ^ Tabachnick & Fidell, 2007, p. 87-88.
  121. ^ Tchakarova, Kalina (October 2020). "2020/31 Comparing job descriptions is insufficient for checking whether work is equally valuable (BG)". European Employment Law Cases. 5 (3): 168–170. doi:10.5553/eelc/187791072020005003006. ISSN 1877-9107. S2CID 229008899.
  122. ^ Random sampling and randomization procedures, BSI British Standards, doi:10.3403/30137438, retrieved 2021-06-03
  123. ^ Adèr 2008a, pp. 344–345.
  124. ^ Sandberg, Margareta (June 2006). "Acupuncture Procedures Must be Accurately Described". Acupuncture in Medicine. 24 (2): 92–94. doi:10.1136/aim.24.2.92. ISSN 0964-5284. PMID 16783285. S2CID 30286074.
  125. ^ Jaarsma, C.F. Verkeer in een landelijk gebied: waarnemingen en analyse van het verkeer in zuidwest Friesland en ontwikkeling van een verkeersmodel. OCLC 1016575584.
  126. ^ Foth, Christian; Hedrick, Brandon P.; Ezcurra, Martin D. (18 January 2016). "Figure 4: Centroid size regression analyses for the main sample". PeerJ. 4: e1589. doi:10.7717/peerj.1589/fig-4.
  127. ^ Adèr 2008a, p. 345.
  128. ^ "The Final Years (1975-84)", The Road Not Taken, Boydell & Brewer, pp. 853–922, 2018-06-18, doi:10.2307/j.ctv6cfncp.26, ISBN 978-1-57647-332-0, S2CID 242072487, retrieved 2021-06-03
  129. ^ Fitzmaurice, Kathryn (17 March 2015). Destiny, rewritten. HarperCollins. ISBN 978-0-06-162503-9. OCLC 905090570.
  130. ^ Adèr 2008a, pp. 345–346.
  131. ^ Adèr 2008a, pp. 346–347.
  132. ^ Adèr 2008a, pp. 349–353.
  133. ^ Billings S.A. "Nonlinear System Identification: NARMAX Methods in the Time, Frequency, and Spatio-Temporal Domains". Wiley, 2013
  134. ^ Adèr 2008b, p. 363.
  135. ^ "Exploratory Data Analysis", Python® for R Users, Hoboken, NJ, USA: John Wiley & Sons, Inc., pp. 119–138, 2017-10-13, doi:10.1002/9781119126805.ch4, hdl:11380/971504, ISBN 978-1-119-12680-5, retrieved 2021-06-03
  136. ^ "Engaging in Exploratory Data Analysis, Visualization, and Hypothesis Testing – Exploratory Data Analysis, Geovisualization, and Data", Spatial Analysis, CRC Press, pp. 106–139, 2015-07-28, doi:10.1201/b18808-8, ISBN 978-0-429-06936-9, S2CID 133412598, retrieved 2021-06-03
  137. ^ "Hypotheses About Categories", Starting Statistics: A Short, Clear Guide, London: SAGE Publications Ltd, pp. 138–151, 2010, doi:10.4135/9781446287873.n14, ISBN 978-1-84920-098-1, retrieved 2021-06-03
  138. ^ Sordo, Rachele Del; Sidoni, Angelo (December 2008). "MIB-1 Cell Membrane Reactivity: A Finding That Should be Interpreted Carefully". Applied Immunohistochemistry & Molecular Morphology. 16 (6): 568. doi:10.1097/pai.0b013e31817af2cf. ISSN 1541-2016. PMID 18800001.
  139. ^ Liquet, Benoit; Riou, Jérémie (2013-06-08). "Correction of the significance level when attempting multiple transformations of an explanatory variable in generalized linear models". BMC Medical Research Methodology. 13 (1): 75. doi:10.1186/1471-2288-13-75. ISSN 1471-2288. PMC 3699399. PMID 23758852.
  140. ^ a b c Mcardle, John J. (2008). "Some ethical issues in confirmatory versus exploratory analysis". PsycEXTRA Dataset. doi:10.1037/e503312008-001. Retrieved 2021-06-03.
  141. ^ Adèr 2008b, pp. 361–362.
  142. ^ Adèr 2008b, pp. 361–371.
  143. ^ Truswell IV, William H., ed. (2009), "3 The Facelift: A Guide for Safe, Reliable, and Reproducible Results", Surgical Facial Rejuvenation, Stuttgart: Georg Thieme Verlag, doi:10.1055/b-0034-73436, ISBN 978-1-58890-491-1, retrieved 2021-06-03
  144. ^ Benson, Noah C; Winawer, Jonathan (December 2018). "Bayesian analysis of retinotopic maps". eLife. 7. doi:10.7554/elife.40224. PMC 6340702. PMID 30520736. Supplementary file 1. Cross-validation schema. doi:10.7554/elife.40224.014
  145. ^ Hsiao, Cheng (2014), "Cross-Sectionally Dependent Panel Data", Analysis of Panel Data, Cambridge: Cambridge University Press, pp. 327–368, doi:10.1017/cbo9781139839327.012, ISBN 978-1-139-83932-7, retrieved 2021-06-03
  146. ^ Hjorth, J.S. Urban (2017-10-19), "Cross validation", Computer Intensive Statistical Methods, Chapman and Hall/CRC, pp. 24–56, doi:10.1201/9781315140056-3, ISBN 978-1-315-14005-6, retrieved 2021-06-03
  147. ^ Sheikholeslami, Razi; Razavi, Saman; Haghnegahdar, Amin (2019-10-10). "What should we do when a model crashes? Recommendations for global sensitivity analysis of Earth and environmental systems models". Geoscientific Model Development. 12 (10): 4275–4296. Bibcode:2019GMD....12.4275S. doi:10.5194/gmd-12-4275-2019. ISSN 1991-9603. S2CID 204900339.
  148. ^ United Nations Development Programme (2018). "Human development composite indices". Human Development Indices and Indicators 2018. United Nations. pp. 21–41. doi:10.18356/ce6f8e92-en. S2CID 240207510.
  149. ^ Wiley, Matt; Wiley, Joshua F. (2019), "Multivariate Data Visualization", Advanced R Statistical Programming and Data Models, Berkeley, CA: Apress, pp. 33–59, doi:10.1007/978-1-4842-2872-2_2, ISBN 978-1-4842-2871-5, S2CID 86629516, retrieved 2021-06-03
  150. ^ Mailund, Thomas (2022). Beginning Data Science in R 4: Data Analysis, Visualization, and Modelling for the Data Scientist (2nd ed.). ISBN 978-148428155-0.
  151. ^ Orduna-Malea, Enrique; Alonso-Arroyo, Adolfo (2018), "A cybermetric analysis model to measure private companies", Cybermetric Techniques to Evaluate Organizations Using Web-Based Data, Elsevier, pp. 63–76, doi:10.1016/b978-0-08-101877-4.00003-x, ISBN 978-0-08-101877-4, retrieved 2021-06-03
  152. ^ Leen, A.R. The consumer in Austrian economics and the Austrian perspective on consumer policy. Wageningen Universiteit. ISBN 90-5808-102-8. OCLC 1016689036.
  153. ^ "Examples of Survival Data Analysis", Statistical Methods for Survival Data Analysis, Wiley Series in Probability and Statistics, Hoboken, NJ, USA: John Wiley & Sons, Inc., 2003-06-30, pp. 19–63, doi:10.1002/0471458546.ch3, ISBN 978-0-471-45854-8, retrieved 2021-06-03
  154. ^ "The machine learning community takes on the Higgs". Symmetry Magazine. July 15, 2014. Archived from the original on 16 April 2021. Retrieved 14 January 2015.
  155. ^ Nehme, Jean (September 29, 2016). "LTPP International Data Analysis Contest". Federal Highway Administration. Archived from the original on October 21, 2017. Retrieved October 22, 2017.
  156. ^ "Data.Gov:Long-Term Pavement Performance (LTPP)". May 26, 2016. Archived from the original on November 1, 2017. Retrieved November 10, 2017.

Bibliography

  • Adèr, Herman J. (2008a). "Chapter 14: Phases and initial steps in data analysis". In Adèr, Herman J.; Mellenbergh, Gideon J.; Hand, David J (eds.). Advising on research methods : a consultant's companion. Huizen, Netherlands: Johannes van Kessel Pub. pp. 333–356. ISBN 9789079418015. OCLC 905799857.
  • Adèr, Herman J. (2008b). "Chapter 15: The main analysis phase". In Adèr, Herman J.; Mellenbergh, Gideon J.; Hand, David J (eds.). Advising on research methods : a consultant's companion. Huizen, Netherlands: Johannes van Kessel Pub. pp. 357–386. ISBN 9789079418015. OCLC 905799857.
  • Tabachnick, B.G. & Fidell, L.S. (2007). Chapter 4: Cleaning up your act. Screening data prior to analysis. In B.G. Tabachnick & L.S. Fidell (Eds.), Using Multivariate Statistics, Fifth Edition (pp. 60–116). Boston: Pearson Education, Inc. / Allyn and Bacon.

Further reading

  • Adèr, H.J. & Mellenbergh, G.J. (with contributions by D.J. Hand) (2008). Advising on Research Methods: A Consultant's Companion. Huizen, the Netherlands: Johannes van Kessel Publishing. ISBN 978-90-79418-01-5
  • Chambers, John M.; Cleveland, William S.; Kleiner, Beat; Tukey, Paul A. (1983). Graphical Methods for Data Analysis, Wadsworth/Duxbury Press. ISBN 0-534-98052-X
  • Fandango, Armando (2017). Python Data Analysis, 2nd Edition. Packt Publishers. ISBN 978-1787127487
  • Juran, Joseph M.; Godfrey, A. Blanton (1999). Juran's Quality Handbook, 5th Edition. New York: McGraw Hill. ISBN 0-07-034003-X
  • Lewis-Beck, Michael S. (1995). Data Analysis: an Introduction, Sage Publications Inc, ISBN 0-8039-5772-6
  • NIST/SEMATECH (2008) Handbook of Statistical Methods,
  • Pyzdek, T, (2003). Quality Engineering Handbook, ISBN 0-8247-4614-7
  • Richard Veryard (1984). Pragmatic Data Analysis. Oxford : Blackwell Scientific Publications. ISBN 0-632-01311-7
  • Tabachnick, B.G.; Fidell, L.S. (2007). Using Multivariate Statistics, 5th Edition. Boston: Pearson Education, Inc. / Allyn and Bacon, ISBN 978-0-205-45938-4