<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
<channel>
<title>Ask Ghassem - Recent questions tagged statistics</title>
<link>https://ask.ghassem.com/tag/statistics</link>
<description>Powered by Question2Answer</description>
<item>
<title>Individual and group relative strength in a fixed pool of players: How to approach the problem?</title>
<link>https://ask.ghassem.com/751/individual-group-relative-strength-players-approach-problem</link>
<description>&lt;div&gt;I apologize in advance if my question sounds too basic to be worthy of anyone&#039;s time, but statistics are not part of my curriculum.&lt;/div&gt;

&lt;div&gt;
&lt;p&gt;I am developing a proof of concept of a web application modeling the contribution of individual soccer player with respect to the different teams they&#039;ve played with throughout their career. In particular, I am looking into a way of &lt;em&gt;ranking&lt;/em&gt; both individuals and groups of players as follows::&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;teammates relative strength&lt;/strong&gt;: the best/worst combinations of players when playing in the same team in the same matches;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;opponents relative strength&lt;/strong&gt;: the best/worst combinations of players when playing in opposite teams in the same matches, i.e. which tuples of teammates are the best/worst against which;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I must admit I don&#039;t quite know how to approach the problem (as I said I have no formal education in statistics or data science). I would be very grateful&amp;nbsp; if anyone could give me some directions. How should I frame this particular problem and what resources in statistics or machine learning (if indeed this is a task fit for machine learning, perhaps I am mistaken on this) would be appropriate to tackle it?&lt;/p&gt;

&lt;p&gt;I am eager to learn, so both practical examples or theoretical references (book chapters, online articles, etc) would be very welcome.&lt;/p&gt;

&lt;p&gt;Thanks in advance!&lt;/p&gt;
&lt;/div&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/751/individual-group-relative-strength-players-approach-problem</guid>
<pubDate>Tue, 29 Oct 2019 20:00:28 +0000</pubDate>
</item>
<item>
<title>Using aggregate data to generate observation-level data statistically sound?</title>
<link>https://ask.ghassem.com/644/using-aggregate-generate-observation-level-statistically</link>
<description>&lt;p&gt;Context: In the realm of Paid Search Marketing. Current reporting does not provide event level data only aggregate totals with different segments.&amp;nbsp; Want to compare distributions/test statistical significance of A/B test results.&amp;nbsp; Did not want to assume that data followed normal distribution or know STDEV&amp;nbsp;for data so came with this approach.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;My Question: I am going to use the average &quot;CPA&quot; or &quot;CTR&quot; for a date range, and generate an observation for each conversion based off the average for a time range.&amp;nbsp; Is this statistically sound way if I want to generate raw data? Would I have wonky distributions because of the multiple averages?&amp;nbsp; Just want a gutcheck if I&#039;m completely off base.&amp;nbsp;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;My Aggregate data looks like below:&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;1&quot; style=&quot;width:500px&quot;&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th scope=&quot;col&quot;&gt;Day&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;Cost&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;Acquisition&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;CPA or CTR&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp;1&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 40&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 75&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;Observation data I generate looks like below:&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;1&quot; style=&quot;width:500px&quot;&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th scope=&quot;col&quot;&gt;Day&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;Acquisition&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&amp;nbsp;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;I really appreciate your help with this question! An important project to me at work.&amp;nbsp;&amp;nbsp;&lt;/p&gt;</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/644/using-aggregate-generate-observation-level-statistically</guid>
<pubDate>Tue, 11 Jun 2019 22:04:01 +0000</pubDate>
</item>
<item>
<title>Looking for guidance on whether I have the necessary data to answer a Regression question</title>
<link>https://ask.ghassem.com/595/looking-guidance-whether-necessary-answer-regression-question</link>
<description>&lt;p&gt;Hi everyone.&lt;/p&gt;

&lt;p&gt;I&#039;m currently working on my final project for a Data Science degree and after a month of literature review, exploratory analysis and model testing,&amp;nbsp;I&#039;m not sure if the questions I set out to answer are suitable for&amp;nbsp;the data I have.&lt;/p&gt;

&lt;p&gt;This is a very broad question I&#039;m asking here, as it&#039;s more guidance than anything else, so if this is not the place to ask, I would appreciate it if you could redirect me to the right place.&lt;/p&gt;

&lt;p&gt;You can find the data sets and code on my github &lt;a rel=&quot;nofollow&quot; href=&quot;https://github.com/TomGoncalves/IAQ-Project&quot;&gt;here&lt;/a&gt;.&amp;nbsp;The code is messy but working; I&#039;ve only picked up programming last year.&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;h3&gt;&lt;strong&gt;The data&lt;/strong&gt;&lt;/h3&gt;

&lt;p&gt;Indoor Air Quality data recorded hourly through 4 sensors (Kitchen, Bedroom, Living Room, Bathroom) for 7 days in a house for a total of 3 houses. For 6 of those days, each sensor was in a different room and on the last one, all sensors were together so we could see how spread apart their signals were and account for that). So in here&amp;nbsp;I have 9 continuous variables: Temperature, Relative Humidity, CO, CO2, TVOC, PM2.5, NO2, Ozone and Air Pressure.&lt;/p&gt;

&lt;p&gt;I then got 3 manually-filled questionnaires on Occupant Activity, one for each house, such as &quot;Door open/closed&quot;, &quot;Window open/closed&quot;, &quot;Heating On/off&quot;, &quot;Frying&quot;, &quot;Boiling&quot;, &quot;Hoovering&quot;, &quot;Mopping&quot;, etc. Now, these logs were missing a lot of data.&lt;/p&gt;

&lt;p&gt;These questionnaires were a mess and a lot of the missing values had to be imputed. This data is reported in binary format such as &quot;Did Activity X occur at hour Y? - Yes(1)/No(1).&lt;/p&gt;

&lt;p&gt;With this project I&#039;ve chosen to&amp;nbsp;predict a sensor data variable (in this case CO2), based on activities.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;h3&gt;&lt;strong&gt;Models&lt;/strong&gt;&lt;/h3&gt;

&lt;p&gt;Just to have a feel for the data, I&amp;nbsp;ran a Linear Regression, Decision Tree and Random Forest model with a choice of only Occupant Activity predictors and both Occupant Activity and other sensor variables as predictors on individual rooms of each house and the results are just atrocious in every case. Cross-validation shows the model&#039;s performance to be all over the place and looking at features for statistical significance gives me different significant features in every room of every house, it&#039;s like I&#039;m playing feature roulette. Problem with some features such as Mopping, Frying, Boiling, Hoovering is that there will be a lot of &quot;0&quot;s in comparison to &quot;1&quot;s due to the nature of the feature, so one or two &quot;1&quot;s in the wrong place is enough to give a misguided correlation.&lt;/p&gt;

&lt;p&gt;As you can tell and see from this, I&#039;m still a Data Scientist in-training here, having only done a few models in the past and rather new-ish to programming (1 year experience).&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;h3&gt;&lt;strong&gt;What I&#039;m looking for&lt;/strong&gt;&lt;/h3&gt;

&lt;p&gt;I suppose that more than anything, I&#039;m asking for guidance on whether pursuing this as a Regression problem is feasible or not.&lt;/p&gt;

&lt;p&gt;I&#039;m very short on time but if this won&#039;t work, I can look into alternatives. For instance,&amp;nbsp;Air Pollutants have safety thresholds. I could create a class feature on whether the value is over the threshold or not and turn it into a classification problem or even a cluster one to identify the room based on activities and air pollutants..&lt;/p&gt;

&lt;p&gt;Bottom-line is that I have a 12,500 word paper to deliver in a month, I&#039;ve been at this for month already with nothing to show for, so I&#039;m hoping someone with more experience under their belt could see if I&#039;m chasing a dead end.&amp;nbsp;Any help in the form of guidance would be so very much appreciated, I&#039;ve ran out of ideas here.&lt;/p&gt;

&lt;p&gt;Thanks,&lt;/p&gt;

&lt;p&gt;Tom&lt;/p&gt;</description>
<category>Machine Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/595/looking-guidance-whether-necessary-answer-regression-question</guid>
<pubDate>Sun, 24 Mar 2019 23:40:09 +0000</pubDate>
</item>
<item>
<title>How do I know when it is appropriate to use stratified sampling?</title>
<link>https://ask.ghassem.com/568/how-do-know-when-it-is-appropriate-to-use-stratified-sampling</link>
<description></description>
<category>Machine Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/568/how-do-know-when-it-is-appropriate-to-use-stratified-sampling</guid>
<pubDate>Tue, 19 Feb 2019 18:52:48 +0000</pubDate>
</item>
<item>
<title>What is the easiest way to distinguish whether to use a z value test or a t value test?</title>
<link>https://ask.ghassem.com/553/what-the-easiest-distinguish-whether-value-test-value-test</link>
<description></description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/553/what-the-easiest-distinguish-whether-value-test-value-test</guid>
<pubDate>Tue, 18 Dec 2018 04:33:27 +0000</pubDate>
</item>
<item>
<title>nonpooled independent samples t-interval method</title>
<link>https://ask.ghassem.com/548/nonpooled-independent-samples-t-interval-method</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/548/nonpooled-independent-samples-t-interval-method</guid>
<pubDate>Fri, 07 Dec 2018 23:06:04 +0000</pubDate>
</item>
<item>
<title>Computing the Confidence Interval for a Difference Between Two Means</title>
<link>https://ask.ghassem.com/546/computing-the-confidence-interval-difference-between-means</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/546/computing-the-confidence-interval-difference-between-means</guid>
<pubDate>Fri, 07 Dec 2018 23:02:45 +0000</pubDate>
</item>
<item>
<title>How to find the strength of a P-value against a null hypothesis?</title>
<link>https://ask.ghassem.com/545/how-to-find-the-strength-of-a-p-value-against-null-hypothesis</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/545/how-to-find-the-strength-of-a-p-value-against-null-hypothesis</guid>
<pubDate>Fri, 07 Dec 2018 22:31:33 +0000</pubDate>
</item>
<item>
<title>What is degree of Freedom while calculating confidence interval?</title>
<link>https://ask.ghassem.com/544/what-degree-freedom-while-calculating-confidence-interval</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/544/what-degree-freedom-while-calculating-confidence-interval</guid>
<pubDate>Fri, 07 Dec 2018 22:27:27 +0000</pubDate>
</item>
<item>
<title>How do you know when to use T-Distribution instead of regular Confidence Interval?</title>
<link>https://ask.ghassem.com/537/know-when-distribution-instead-regular-confidence-interval</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/537/know-when-distribution-instead-regular-confidence-interval</guid>
<pubDate>Wed, 05 Dec 2018 22:50:48 +0000</pubDate>
</item>
<item>
<title>What is the T Distribution?</title>
<link>https://ask.ghassem.com/534/what-is-the-t-distribution</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/534/what-is-the-t-distribution</guid>
<pubDate>Fri, 30 Nov 2018 21:22:02 +0000</pubDate>
</item>
<item>
<title>Probability for the data items within mean +- 1, +-2, +-3 standard deviation?</title>
<link>https://ask.ghassem.com/529/probability-for-data-items-within-mean-standard-deviation</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/529/probability-for-data-items-within-mean-standard-deviation</guid>
<pubDate>Fri, 30 Nov 2018 21:15:35 +0000</pubDate>
</item>
<item>
<title>What is &#039;Degrees of Freedom&#039;?</title>
<link>https://ask.ghassem.com/527/what-is-degrees-of-freedom</link>
<description>What is &amp;#039;Degrees of Freedom&amp;#039;?</description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/527/what-is-degrees-of-freedom</guid>
<pubDate>Thu, 29 Nov 2018 18:49:06 +0000</pubDate>
</item>
<item>
<title>How do I calculate a and z a/2?</title>
<link>https://ask.ghassem.com/525/how-do-i-calculate-a-and-z-a-2</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/525/how-do-i-calculate-a-and-z-a-2</guid>
<pubDate>Thu, 29 Nov 2018 18:19:23 +0000</pubDate>
</item>
<item>
<title>What is the rule that determines percentages with the mean +- 1,2 or 3 standard deviations?</title>
<link>https://ask.ghassem.com/523/what-rule-that-determines-percentages-standard-deviations</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/523/what-rule-that-determines-percentages-standard-deviations</guid>
<pubDate>Sat, 17 Nov 2018 19:03:35 +0000</pubDate>
</item>
<item>
<title>What will be probability for the data items within mean +- 1, +-2, +-3 standard deviation?</title>
<link>https://ask.ghassem.com/520/what-will-probability-data-items-within-standard-deviation</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/520/what-will-probability-data-items-within-standard-deviation</guid>
<pubDate>Wed, 14 Nov 2018 15:10:15 +0000</pubDate>
</item>
<item>
<title>How much percentage of data items are consumed within mean +-1 , +-2, +-3 standard deviation?</title>
<link>https://ask.ghassem.com/518/much-percentage-items-consumed-within-standard-deviation</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/518/much-percentage-items-consumed-within-standard-deviation</guid>
<pubDate>Wed, 14 Nov 2018 15:04:52 +0000</pubDate>
</item>
<item>
<title>What is a test statistic in hypothesis testing and how does it relate to the p-value?</title>
<link>https://ask.ghassem.com/516/what-test-statistic-hypothesis-testing-and-does-relate-value</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/516/what-test-statistic-hypothesis-testing-and-does-relate-value</guid>
<pubDate>Tue, 13 Nov 2018 00:51:11 +0000</pubDate>
</item>
<item>
<title>What is sampling error?</title>
<link>https://ask.ghassem.com/514/what-is-sampling-error</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/514/what-is-sampling-error</guid>
<pubDate>Tue, 13 Nov 2018 00:47:51 +0000</pubDate>
</item>
<item>
<title>What is P-value in Statistical Testing?</title>
<link>https://ask.ghassem.com/510/what-is-p-value-in-statistical-testing</link>
<description>What is P-value in Statistical testing?</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/510/what-is-p-value-in-statistical-testing</guid>
<pubDate>Mon, 12 Nov 2018 13:41:39 +0000</pubDate>
</item>
<item>
<title>What is a sampling distribution of sample means?</title>
<link>https://ask.ghassem.com/508/what-is-a-sampling-distribution-of-sample-means</link>
<description>What is a sampling distribution of sample means?</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/508/what-is-a-sampling-distribution-of-sample-means</guid>
<pubDate>Mon, 12 Nov 2018 13:38:30 +0000</pubDate>
</item>
<item>
<title>In the first week of November 2003.... What is the expected number of homicides per week in the GTA?</title>
<link>https://ask.ghassem.com/503/first-week-november-2003-what-expected-number-homicides-week</link>
<description>This is from question one from lecture 4B&amp;#039;s exercises: &amp;quot;In the first week of November 2003, five unrelated homicides were reported in the GTA. The GTA usually has about 78 homicides per year. What is the expected number of homicides per week in the GTA?&amp;quot;</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/503/first-week-november-2003-what-expected-number-homicides-week</guid>
<pubDate>Fri, 02 Nov 2018 19:34:40 +0000</pubDate>
</item>
<item>
<title>What is summary statistics?</title>
<link>https://ask.ghassem.com/500/what-is-summary-statistics</link>
<description></description>
<category>Data Science Interview Questions</category>
<guid isPermaLink="true">https://ask.ghassem.com/500/what-is-summary-statistics</guid>
<pubDate>Thu, 01 Nov 2018 19:45:16 +0000</pubDate>
</item>
<item>
<title>What is the difference between univariate and multivariate analysis?</title>
<link>https://ask.ghassem.com/491/what-difference-between-univariate-multivariate-analysis</link>
<description></description>
<category>Data Science Interview Questions</category>
<guid isPermaLink="true">https://ask.ghassem.com/491/what-difference-between-univariate-multivariate-analysis</guid>
<pubDate>Tue, 30 Oct 2018 11:39:08 +0000</pubDate>
</item>
<item>
<title>What is the purpose of randomization in statistics?</title>
<link>https://ask.ghassem.com/486/what-is-the-purpose-of-randomization-in-statistics</link>
<description></description>
<category>Data Science Interview Questions</category>
<guid isPermaLink="true">https://ask.ghassem.com/486/what-is-the-purpose-of-randomization-in-statistics</guid>
<pubDate>Tue, 30 Oct 2018 11:35:52 +0000</pubDate>
</item>
<item>
<title>In statistics, whats the difference between contingency and frequency tables?</title>
<link>https://ask.ghassem.com/483/statistics-difference-between-contingency-frequency-tables</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/483/statistics-difference-between-contingency-frequency-tables</guid>
<pubDate>Tue, 30 Oct 2018 11:19:48 +0000</pubDate>
</item>
<item>
<title>Which scenarios among the following are a valid reason to use regularization?</title>
<link>https://ask.ghassem.com/451/which-scenarios-among-following-valid-reason-regularization</link>
<description>A. To drop the least useful variables of a model&lt;br /&gt;
&lt;br /&gt;
B. To reduce over-fitting&lt;br /&gt;
&lt;br /&gt;
C. To reduce the bias of a model&lt;br /&gt;
&lt;br /&gt;
D. To decrease p-value</description>
<category>Data Science Interview Questions</category>
<guid isPermaLink="true">https://ask.ghassem.com/451/which-scenarios-among-following-valid-reason-regularization</guid>
<pubDate>Sat, 27 Oct 2018 17:31:43 +0000</pubDate>
</item>
<item>
<title>How do you tell a permutation problem from a combination problem?</title>
<link>https://ask.ghassem.com/396/how-you-tell-permutation-problem-from-combination-problem</link>
<description>Given the questions:&lt;br /&gt;
&lt;br /&gt;
1. How many different groups of 4 students can be made from a class of 40?&lt;br /&gt;
&lt;br /&gt;
and&lt;br /&gt;
&lt;br /&gt;
2. How many different lists of 4 students can be made from a class of 40?&lt;br /&gt;
&lt;br /&gt;
How do you determine which requires a permutation and which requires a combination?</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/396/how-you-tell-permutation-problem-from-combination-problem</guid>
<pubDate>Mon, 15 Oct 2018 15:34:13 +0000</pubDate>
</item>
<item>
<title>What is the use of the Poisson Distribution?</title>
<link>https://ask.ghassem.com/376/what-is-the-use-of-the-poisson-distribution</link>
<description>Also, when would one use this distribution ?</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/376/what-is-the-use-of-the-poisson-distribution</guid>
<pubDate>Sat, 13 Oct 2018 15:37:29 +0000</pubDate>
</item>
<item>
<title>How do we determine if two events are independent</title>
<link>https://ask.ghassem.com/374/how-do-we-determine-if-two-events-are-independent</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/374/how-do-we-determine-if-two-events-are-independent</guid>
<pubDate>Sat, 13 Oct 2018 15:23:39 +0000</pubDate>
</item>
<item>
<title>In the context of probability, what is the &quot;event space&quot;</title>
<link>https://ask.ghassem.com/372/in-the-context-of-probability-what-is-the-event-space</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/372/in-the-context-of-probability-what-is-the-event-space</guid>
<pubDate>Sat, 13 Oct 2018 15:10:28 +0000</pubDate>
</item>
<item>
<title>What is the &quot;Three-Standard-Deviations Rule&quot;</title>
<link>https://ask.ghassem.com/370/what-is-the-three-standard-deviations-rule</link>
<description>This rule is also known as the &amp;quot;68–95–99.7 rule&amp;quot;. &amp;nbsp;What does this rule stand for and how does it help me?</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/370/what-is-the-three-standard-deviations-rule</guid>
<pubDate>Sat, 13 Oct 2018 14:57:53 +0000</pubDate>
</item>
<item>
<title>How do histograms help us understand a data set?</title>
<link>https://ask.ghassem.com/367/how-do-histograms-help-us-understand-a-data-set</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/367/how-do-histograms-help-us-understand-a-data-set</guid>
<pubDate>Sat, 13 Oct 2018 14:42:11 +0000</pubDate>
</item>
<item>
<title>What is a discrete random variable?</title>
<link>https://ask.ghassem.com/365/what-is-a-discrete-random-variable</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/365/what-is-a-discrete-random-variable</guid>
<pubDate>Sat, 13 Oct 2018 01:23:24 +0000</pubDate>
</item>
<item>
<title>What is the difference between permutation and combination?</title>
<link>https://ask.ghassem.com/363/what-is-the-difference-between-permutation-and-combination</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/363/what-is-the-difference-between-permutation-and-combination</guid>
<pubDate>Sat, 13 Oct 2018 01:19:28 +0000</pubDate>
</item>
<item>
<title>What is the difference between Qualitative and Quantitative data?</title>
<link>https://ask.ghassem.com/361/what-the-difference-between-qualitative-quantitative-data</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/361/what-the-difference-between-qualitative-quantitative-data</guid>
<pubDate>Sat, 13 Oct 2018 01:15:59 +0000</pubDate>
</item>
<item>
<title>If x is the number that comes up when you roll a 20 sided die. What is the expected value of x?</title>
<link>https://ask.ghassem.com/360/the-number-that-comes-when-you-roll-sided-what-expected-value</link>
<description>If x is the number that comes up when you roll a 20 sided die. What is the expected value of x?</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/360/the-number-that-comes-when-you-roll-sided-what-expected-value</guid>
<pubDate>Fri, 12 Oct 2018 22:43:05 +0000</pubDate>
</item>
<item>
<title>What is the probability of getting a King of Hearts?</title>
<link>https://ask.ghassem.com/359/what-is-the-probability-of-getting-a-king-of-hearts</link>
<description>What is the probability of picking a King of Hearts in a standard 52 card deck?</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/359/what-is-the-probability-of-getting-a-king-of-hearts</guid>
<pubDate>Fri, 12 Oct 2018 22:31:45 +0000</pubDate>
</item>
<item>
<title>Define measures of center (Median and Mode) ?</title>
<link>https://ask.ghassem.com/357/define-measures-of-center-median-and-mode</link>
<description>Define the 2 measures of centers.</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/357/define-measures-of-center-median-and-mode</guid>
<pubDate>Fri, 12 Oct 2018 22:28:59 +0000</pubDate>
</item>
<item>
<title>What is &quot;Random Sampling&quot;?</title>
<link>https://ask.ghassem.com/356/what-is-random-sampling</link>
<description>Define random sampling</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/356/what-is-random-sampling</guid>
<pubDate>Fri, 12 Oct 2018 22:26:06 +0000</pubDate>
</item>
<item>
<title>What is binomial experiment and when do we use it?</title>
<link>https://ask.ghassem.com/355/what-is-binomial-experiment-and-when-do-we-use-it</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/355/what-is-binomial-experiment-and-when-do-we-use-it</guid>
<pubDate>Fri, 12 Oct 2018 22:25:59 +0000</pubDate>
</item>
<item>
<title>When should we use permutation and combination?</title>
<link>https://ask.ghassem.com/353/when-should-we-use-permutation-and-combination</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/353/when-should-we-use-permutation-and-combination</guid>
<pubDate>Fri, 12 Oct 2018 22:16:45 +0000</pubDate>
</item>
<item>
<title>Calculate IQR(Inter Quartile Range) of {18, 24, 19, 16, 21}?</title>
<link>https://ask.ghassem.com/351/calculate-iqr-inter-quartile-range-of-18-24-19-16-21</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/351/calculate-iqr-inter-quartile-range-of-18-24-19-16-21</guid>
<pubDate>Fri, 12 Oct 2018 22:07:58 +0000</pubDate>
</item>
<item>
<title>What do population, parameter, census and sample mean in statistics?</title>
<link>https://ask.ghassem.com/349/what-population-parameter-census-and-sample-mean-statistics</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/349/what-population-parameter-census-and-sample-mean-statistics</guid>
<pubDate>Fri, 12 Oct 2018 21:50:34 +0000</pubDate>
</item>
<item>
<title>Question for Discrete random variable</title>
<link>https://ask.ghassem.com/347/question-for-discrete-random-variable</link>
<description>An investor has a 0.60 probability of making a 20000 dollar profit and a 0.40 probability of suffering a 25000 dollar loss. What is the expected value?</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/347/question-for-discrete-random-variable</guid>
<pubDate>Fri, 12 Oct 2018 21:00:40 +0000</pubDate>
</item>
<item>
<title>Sample question of permutation and combination.</title>
<link>https://ask.ghassem.com/345/sample-question-of-permutation-and-combination</link>
<description>You can bring along 2 friends! you have 5 friends who want to come along. How many different groups of friends could you take with you?</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/345/sample-question-of-permutation-and-combination</guid>
<pubDate>Fri, 12 Oct 2018 20:47:03 +0000</pubDate>
</item>
<item>
<title>{1.75,  1.63,  1.55,  1.92,  1.81, 1.79, 1.81}. Determine the mean, the median, and the 20th percentile?</title>
<link>https://ask.ghassem.com/343/75-63-55-determine-the-mean-the-median-and-the-20th-percentile</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/343/75-63-55-determine-the-mean-the-median-and-the-20th-percentile</guid>
<pubDate>Fri, 12 Oct 2018 19:42:32 +0000</pubDate>
</item>
<item>
<title>What is  difference between discrete numerical variable and continuous numerical variable?</title>
<link>https://ask.ghassem.com/341/difference-discrete-numerical-variable-continuous-numerical</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/341/difference-discrete-numerical-variable-continuous-numerical</guid>
<pubDate>Fri, 12 Oct 2018 19:25:11 +0000</pubDate>
</item>
<item>
<title>What is the mean and standard deviation of a standard normal distribution?</title>
<link>https://ask.ghassem.com/330/what-mean-standard-deviation-standard-normal-distribution</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/330/what-mean-standard-deviation-standard-normal-distribution</guid>
<pubDate>Fri, 12 Oct 2018 01:39:12 +0000</pubDate>
</item>
<item>
<title>How to calculate normal distribution?</title>
<link>https://ask.ghassem.com/532/how-to-calculate-normal-distribution</link>
<description></description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/532/how-to-calculate-normal-distribution</guid>
<pubDate>Sat, 01 Sep 2018 20:20:01 +0000</pubDate>
</item>
</channel>
</rss>