<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
<channel>
<title>Ask Ghassem - Recent activity in Statistics</title>
<link>https://ask.ghassem.com/activity/data-science/statistics</link>
<description>Powered by Question2Answer</description>
<item>
<title>How do I compare the count of a value in each year while having a different sanple size each year.</title>
<link>https://ask.ghassem.com/1025/compare-count-value-each-year-while-having-different-sanple</link>
<description>How do I accurately compare between the number of something a survey measure from my employees each year with a varying umber of survey engagement and employee size?&lt;br /&gt;
&lt;br /&gt;
If I was measuring the satisfaction of my employees over the years by collecting a survey from my them each year by asking them wether they are satisfied or not, and then comparing yes’s over the years but the number of employees who answer is not the same each year and the number of employees increases every year. How do I correctly compare this throughout each year?&lt;br /&gt;
&lt;br /&gt;
In other words, how do I remove the effect of the survey engagement rate when calculating the results?</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/1025/compare-count-value-each-year-while-having-different-sanple</guid>
<pubDate>Wed, 08 Jun 2022 10:32:33 +0000</pubDate>
</item>
<item>
<title>is it possible to derive a new 95% CI from two separate 95% CIs?</title>
<link>https://ask.ghassem.com/961/is-it-possible-to-derive-a-new-95-ci-from-two-separate-95-cis</link>
<description>&lt;div id=&quot;i4c-draggable-container&quot; style=&quot;position: fixed; z-index: 1499; width: 0px; height: 0px;&quot;&gt;
&lt;div class=&quot;resolved&quot; data-reactroot=&quot;&quot; style=&quot;all: initial;&quot;&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;

&lt;div style=&quot;position: fixed; z-index: 1499; width: 0px; height: 0px;&quot;&gt;
&lt;div style=&quot;all: initial;&quot;&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;

&lt;div style=&quot;position: fixed; z-index: 1499; width: 0px; height: 0px;&quot;&gt;
&lt;div style=&quot;all: initial;&quot;&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;div&gt;&amp;nbsp;&lt;/div&gt;

&lt;div&gt;&amp;nbsp;&lt;/div&gt;

&lt;div id=&quot;i4c-dialogs-container&quot;&gt;&amp;nbsp;&lt;/div&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/961/is-it-possible-to-derive-a-new-95-ci-from-two-separate-95-cis</guid>
<pubDate>Mon, 23 Nov 2020 14:45:19 +0000</pubDate>
</item>
<item>
<title>Individual and group relative strength in a fixed pool of players: How to approach the problem?</title>
<link>https://ask.ghassem.com/751/individual-group-relative-strength-players-approach-problem</link>
<description>&lt;div&gt;I apologize in advance if my question sounds too basic to be worthy of anyone&#039;s time, but statistics are not part of my curriculum.&lt;/div&gt;

&lt;div&gt;
&lt;p&gt;I am developing a proof of concept of a web application modeling the contribution of individual soccer player with respect to the different teams they&#039;ve played with throughout their career. In particular, I am looking into a way of &lt;em&gt;ranking&lt;/em&gt; both individuals and groups of players as follows::&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;teammates relative strength&lt;/strong&gt;: the best/worst combinations of players when playing in the same team in the same matches;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;opponents relative strength&lt;/strong&gt;: the best/worst combinations of players when playing in opposite teams in the same matches, i.e. which tuples of teammates are the best/worst against which;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I must admit I don&#039;t quite know how to approach the problem (as I said I have no formal education in statistics or data science). I would be very grateful&amp;nbsp; if anyone could give me some directions. How should I frame this particular problem and what resources in statistics or machine learning (if indeed this is a task fit for machine learning, perhaps I am mistaken on this) would be appropriate to tackle it?&lt;/p&gt;

&lt;p&gt;I am eager to learn, so both practical examples or theoretical references (book chapters, online articles, etc) would be very welcome.&lt;/p&gt;

&lt;p&gt;Thanks in advance!&lt;/p&gt;
&lt;/div&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/751/individual-group-relative-strength-players-approach-problem</guid>
<pubDate>Tue, 29 Oct 2019 20:00:28 +0000</pubDate>
</item>
<item>
<title>Using aggregate data to generate observation-level data statistically sound?</title>
<link>https://ask.ghassem.com/644/using-aggregate-generate-observation-level-statistically</link>
<description>&lt;p&gt;Context: In the realm of Paid Search Marketing. Current reporting does not provide event level data only aggregate totals with different segments.&amp;nbsp; Want to compare distributions/test statistical significance of A/B test results.&amp;nbsp; Did not want to assume that data followed normal distribution or know STDEV&amp;nbsp;for data so came with this approach.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;My Question: I am going to use the average &quot;CPA&quot; or &quot;CTR&quot; for a date range, and generate an observation for each conversion based off the average for a time range.&amp;nbsp; Is this statistically sound way if I want to generate raw data? Would I have wonky distributions because of the multiple averages?&amp;nbsp; Just want a gutcheck if I&#039;m completely off base.&amp;nbsp;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;My Aggregate data looks like below:&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;1&quot; style=&quot;width:500px&quot;&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th scope=&quot;col&quot;&gt;Day&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;Cost&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;Acquisition&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;CPA or CTR&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp;1&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 40&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 75&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;Observation data I generate looks like below:&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;1&quot; style=&quot;width:500px&quot;&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th scope=&quot;col&quot;&gt;Day&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;Acquisition&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&amp;nbsp;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;I really appreciate your help with this question! An important project to me at work.&amp;nbsp;&amp;nbsp;&lt;/p&gt;</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/644/using-aggregate-generate-observation-level-statistically</guid>
<pubDate>Tue, 11 Jun 2019 22:04:01 +0000</pubDate>
</item>
<item>
<title>Answered: What is degree of Freedom while calculating confidence interval?</title>
<link>https://ask.ghassem.com/544/what-degree-freedom-while-calculating-confidence-interval?show=558#a558</link>
<description>&lt;p&gt;Degree of Freedom is the number of values that are free to vary in the computation of a statistic. For more information please take a look at &lt;a rel=&quot;nofollow&quot; href=&quot;http://www.jerrydallal.com/LHSP/dof.htm&quot;&gt;this article&lt;/a&gt;.&amp;nbsp;&lt;/p&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/544/what-degree-freedom-while-calculating-confidence-interval?show=558#a558</guid>
<pubDate>Fri, 28 Dec 2018 16:06:55 +0000</pubDate>
</item>
<item>
<title>Answered: How to find the strength of a P-value against a null hypothesis?</title>
<link>https://ask.ghassem.com/545/how-to-find-the-strength-of-a-p-value-against-null-hypothesis?show=556#a556</link>
<description>P &amp;gt; 0.10 &amp;nbsp;&amp;nbsp;--------------- &amp;nbsp;&amp;nbsp;Weak or None&lt;br /&gt;
&lt;br /&gt;
0.05 &amp;lt; P &amp;lt;= 0.10 --------------- &amp;nbsp;Moderate&lt;br /&gt;
&lt;br /&gt;
0.01 &amp;lt; P &amp;lt;= 0.05 ---------------- &amp;nbsp;Strong&lt;br /&gt;
&lt;br /&gt;
P &amp;lt;= 0.01 ---------------------- Very Strong</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/545/how-to-find-the-strength-of-a-p-value-against-null-hypothesis?show=556#a556</guid>
<pubDate>Fri, 28 Dec 2018 16:01:36 +0000</pubDate>
</item>
<item>
<title>Answered: What are the Gaussian equation parameters?</title>
<link>https://ask.ghassem.com/11/what-are-the-gaussian-equation-parameters?show=555#a555</link>
<description>&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;the parameters ( &lt;em&gt;σ&lt;/em&gt; and &lt;em&gt;μ&lt;/em&gt; ) of the Gaussian equation&lt;br&gt;
&lt;br&gt;
&lt;em&gt;g&lt;/em&gt;(&lt;em&gt;x&lt;/em&gt;)=1&lt;em&gt;σ&lt;/em&gt;2&lt;em&gt;π&lt;/em&gt;√&lt;em&gt;e&lt;/em&gt;−12(&lt;em&gt;x&lt;/em&gt;−&lt;em&gt;μ&lt;/em&gt;&lt;em&gt;σ&lt;/em&gt;)2&lt;/p&gt;

&lt;p&gt;&lt;em&gt;σ&lt;/em&gt; is the standard deviation&lt;br&gt;
&lt;em&gt;μ &lt;/em&gt;is the mean&lt;/p&gt;

&lt;div&gt;&amp;nbsp;&lt;/div&gt;

&lt;div&gt;
&lt;div&gt;
&lt;div&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;

&lt;div&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/11/what-are-the-gaussian-equation-parameters?show=555#a555</guid>
<pubDate>Tue, 18 Dec 2018 04:43:54 +0000</pubDate>
</item>
<item>
<title>Answered: What is the use of following equation?</title>
<link>https://ask.ghassem.com/15/what-is-the-use-of-following-equation?show=554#a554</link>
<description>&lt;p&gt;used to find the hypothesis of a triangle. it is also called the Pythagorean Theorem&lt;/p&gt;

&lt;p&gt;If &lt;em&gt;c&lt;/em&gt; denotes the &lt;a rel=&quot;nofollow&quot; href=&quot;https://en.wikipedia.org/wiki/Length&quot;&gt;length&lt;/a&gt; of the hypotenuse and &lt;em&gt;a&lt;/em&gt; and &lt;em&gt;b&lt;/em&gt; denote the lengths of the other two sides, the Pythagorean theorem can be expressed as the Pythagorean equation:&lt;/p&gt;

&lt;p&gt;a 2 + b 2 = c 2 . {\displaystyle a^{2}+b^{2}=c^{2}.} &lt;img alt=&quot;{\displaystyle a^{2}+b^{2}=c^{2}.}&quot; src=&quot;https://wikimedia.org/api/rest_v1/media/math/render/svg/90b56b985c78deb115014efe90ce634d73dd51fa&quot;&gt;&lt;/p&gt;

&lt;p&gt;If the length of both &lt;em&gt;a&lt;/em&gt; and &lt;em&gt;b&lt;/em&gt; are known, then &lt;em&gt;c&lt;/em&gt; can be calculated as&lt;/p&gt;

&lt;p&gt;c = a 2 + b 2 . {\displaystyle c={\sqrt {a^{2}+b^{2}}}.} &lt;img alt=&quot;{\displaystyle c={\sqrt {a^{2}+b^{2}}}.}&quot; src=&quot;https://wikimedia.org/api/rest_v1/media/math/render/svg/5fd521cee81d583ce94bf6710984cc2a9eb7c3da&quot;&gt;&lt;/p&gt;

&lt;p&gt;If the length of the hypotenuse &lt;em&gt;c&lt;/em&gt; and of one side (&lt;em&gt;a&lt;/em&gt; or &lt;em&gt;b&lt;/em&gt;) are known, then the length of the other side can be calculated as&lt;/p&gt;

&lt;p&gt;a = c 2 − b 2 {\displaystyle a={\sqrt {c^{2}-b^{2}}}} &lt;img alt=&quot;{\displaystyle a={\sqrt {c^{2}-b^{2}}}}&quot; src=&quot;https://wikimedia.org/api/rest_v1/media/math/render/svg/ca09e8ee119ce95c893ceb2ac1f7ccd3a40fad9c&quot;&gt;&lt;/p&gt;

&lt;p&gt;or&lt;/p&gt;

&lt;p&gt;b = c 2 − a 2 . {\displaystyle b={\sqrt {c^{2}-a^{2}}}.} &lt;img alt=&quot;{\displaystyle b={\sqrt {c^{2}-a^{2}}}.}&quot; src=&quot;https://wikimedia.org/api/rest_v1/media/math/render/svg/77e9dfccdb8a659eeb36f79af461f207265d5911&quot;&gt;&lt;/p&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/15/what-is-the-use-of-following-equation?show=554#a554</guid>
<pubDate>Tue, 18 Dec 2018 04:42:20 +0000</pubDate>
</item>
<item>
<title>Answered: nonpooled independent samples t-interval method</title>
<link>https://ask.ghassem.com/548/nonpooled-independent-samples-t-interval-method?show=549#a549</link>
<description>The test statistic is : $t=\frac{\bar{x}_1-\bar{x}_2}{s_p \sqrt{\frac{1}{n_1}+\frac{1}{n_2}}}$&lt;br /&gt;
&lt;br /&gt;
The degrees of freedom are found using a complicated approximation formula. You won’t have to do that calculation &amp;quot;by hand&amp;quot;, but is done by: $DF=\frac{(\frac{s_1^2}{n_1}+\frac{s_2^2}{n_2})^2}{\frac{1}{n_1-1} (\frac{s_1^2}{n_1})^2 + \frac{1}{n_2-1} (\frac{s_2^2}{n_2})^2}$</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/548/nonpooled-independent-samples-t-interval-method?show=549#a549</guid>
<pubDate>Fri, 07 Dec 2018 23:10:28 +0000</pubDate>
</item>
<item>
<title>Answered: Computing the Confidence Interval for a Difference Between Two Means</title>
<link>https://ask.ghassem.com/546/computing-the-confidence-interval-difference-between-means?show=547#a547</link>
<description>&lt;p&gt;If the sample sizes are larger, that is both n&lt;sub&gt;1&lt;/sub&gt;&amp;nbsp;and n&lt;sub&gt;2&lt;/sub&gt;&amp;nbsp;are greater than 30, then one uses the z-table.&lt;/p&gt;

&lt;p&gt;If either sample size is less than 30, then the t-table is used.&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;If n&lt;sub&gt;1&lt;/sub&gt;&amp;nbsp;&amp;gt; 30 and n&lt;sub&gt;2&lt;/sub&gt;&amp;nbsp;&amp;gt; 30, we can use the z-table:&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;img alt=&quot;&quot; src=&quot;http://sphweb.bumc.bu.edu/otlt/MPH-Modules/BS/BS704_Confidence_Intervals/ada-reference.gif&quot;&gt;&lt;img alt=&quot;&quot; src=&quot;http://sphweb.bumc.bu.edu/otlt/MPH-Modules/BS/BS704_Confidence_Intervals/lessonimages/equation_image134.gif&quot;&gt;&lt;/p&gt;

&lt;p&gt;Use Z table for standard normal distribution&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;If n&lt;sub&gt;1&lt;/sub&gt;&amp;nbsp;&amp;lt; 30 or n&lt;sub&gt;2&lt;/sub&gt;&amp;nbsp;&amp;lt; 30, use the t-table:\&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;&lt;img alt=&quot;&quot; src=&quot;http://sphweb.bumc.bu.edu/otlt/MPH-Modules/BS/BS704_Confidence_Intervals/ada-reference.gif&quot;&gt;&lt;img alt=&quot;&quot; src=&quot;http://sphweb.bumc.bu.edu/otlt/MPH-Modules/BS/BS704_Confidence_Intervals/lessonimages/equation_image135.gif&quot;&gt;&lt;/p&gt;

&lt;p&gt;Use the t-table with degrees of freedom = n&lt;sub&gt;1&lt;/sub&gt;+n&lt;sub&gt;2&lt;/sub&gt;-2&lt;/p&gt;

&lt;p&gt;For both large and small samples&amp;nbsp;&lt;strong&gt;Sp&lt;/strong&gt;&amp;nbsp;is the&amp;nbsp;&lt;strong&gt;pooled estimate of the common standard deviation&lt;/strong&gt;&amp;nbsp;(assuming that the variances in the populations are similar) computed as the weighted average of the standard deviations in the samples.&lt;/p&gt;

&lt;p&gt;&lt;img alt=&quot;&quot; src=&quot;http://sphweb.bumc.bu.edu/otlt/MPH-Modules/BS/BS704_Confidence_Intervals/ada-reference.gif&quot;&gt;&lt;img alt=&quot;&quot; src=&quot;http://sphweb.bumc.bu.edu/otlt/MPH-Modules/BS/BS704_Confidence_Intervals/lessonimages/equation_image133.gif&quot;&gt;&lt;/p&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/546/computing-the-confidence-interval-difference-between-means?show=547#a547</guid>
<pubDate>Fri, 07 Dec 2018 23:04:51 +0000</pubDate>
</item>
<item>
<title>Answered: How do you know when to use T-Distribution instead of regular Confidence Interval?</title>
<link>https://ask.ghassem.com/537/know-when-distribution-instead-regular-confidence-interval?show=538#a538</link>
<description>What do you mean by &amp;quot;regular confidence interval&amp;quot;? If your question is when to use z-table and when to use t-table, it depends on the metrics we have. If we have population metrics, that we call them parameters, we should use z-table. If we are estimating population metrics such as mean and standard deviation using the sample, then we should use t-table.</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/537/know-when-distribution-instead-regular-confidence-interval?show=538#a538</guid>
<pubDate>Thu, 06 Dec 2018 00:52:34 +0000</pubDate>
</item>
<item>
<title>Answered: What is the T Distribution?</title>
<link>https://ask.ghassem.com/534/what-is-the-t-distribution?show=535#a535</link>
<description>&lt;p&gt;The&amp;nbsp;&lt;strong&gt;T distribution&amp;nbsp;&lt;/strong&gt;is a family of distributions that look almost identical to the&amp;nbsp;&lt;a rel=&quot;nofollow&quot; href=&quot;https://www.statisticshowto.datasciencecentral.com/probability-and-statistics/normal-distributions/&quot;&gt;normal distribution&amp;nbsp;&lt;/a&gt;curve, only a bit shorter and fatter. The t distribution is used instead of the normal distribution when you have small samples (for more on this, see:&amp;nbsp;&lt;a rel=&quot;nofollow&quot; href=&quot;https://www.statisticshowto.datasciencecentral.com/probability-and-statistics/hypothesis-testing/t-score-vs-z-score/&quot; target=&quot;_blank&quot;&gt;t-score vs. z-score&lt;/a&gt;). The larger the&amp;nbsp;&lt;a rel=&quot;nofollow&quot; href=&quot;https://www.statisticshowto.datasciencecentral.com/probability-and-statistics/find-sample-size/&quot; target=&quot;_blank&quot;&gt;sample size&lt;/a&gt;, the more the t distribution looks like the normal distribution. In fact, for sample sizes larger than 20 (e.g. more degrees of freedom), the distribution is almost exactly like the normal distribution.&lt;/p&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/534/what-is-the-t-distribution?show=535#a535</guid>
<pubDate>Fri, 30 Nov 2018 21:22:29 +0000</pubDate>
</item>
<item>
<title>Answered: How to calculate normal distribution?</title>
<link>https://ask.ghassem.com/532/how-to-calculate-normal-distribution?show=533#a533</link>
<description>&lt;p&gt;The standardized value of a normally distributed random variable is called a&amp;nbsp;&lt;em&gt;Z&lt;/em&gt;&amp;nbsp;score and is calculated using the following formula.&lt;/p&gt;

&lt;p&gt;&lt;img alt=&quot;&quot; src=&quot;http://ci.columbia.edu/ci/premba_test/c0331/images/s6/5836240103.gif&quot;&gt;&lt;/p&gt;

&lt;p&gt;&lt;em&gt;x&lt;/em&gt;&amp;nbsp;= the value that is being standardized&lt;br&gt;
&lt;em&gt;m&lt;/em&gt;&amp;nbsp;= the mean of the distribution&lt;br&gt;
&lt;em&gt;s&lt;/em&gt;&amp;nbsp;= standard deviation of the distribution&lt;/p&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/532/how-to-calculate-normal-distribution?show=533#a533</guid>
<pubDate>Fri, 30 Nov 2018 21:20:23 +0000</pubDate>
</item>
<item>
<title>Answered: Probability for the data items within mean +- 1, +-2, +-3 standard deviation?</title>
<link>https://ask.ghassem.com/529/probability-for-data-items-within-mean-standard-deviation?show=531#a531</link>
<description>Probobality mean for +-1 standard deviation is 0.68.&lt;br /&gt;
Probobality mean &amp;nbsp;for +-2 standard deviation is 0.95.&lt;br /&gt;
Probobality mean for +-3 standard deviation is 0.997.</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/529/probability-for-data-items-within-mean-standard-deviation?show=531#a531</guid>
<pubDate>Fri, 30 Nov 2018 21:18:08 +0000</pubDate>
</item>
<item>
<title>Answered: What will be probability for the data items within mean +- 1, +-2, +-3 standard deviation?</title>
<link>https://ask.ghassem.com/520/what-will-probability-data-items-within-standard-deviation?show=530#a530</link>
<description>Probobality mean for +-1 standard deviation is 0.68.&lt;br /&gt;
Probobality mean &amp;nbsp;for +-2 standard deviation is 0.95.&lt;br /&gt;
Probobality mean for +-3 standard deviation is 0.997.</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/520/what-will-probability-data-items-within-standard-deviation?show=530#a530</guid>
<pubDate>Fri, 30 Nov 2018 21:17:20 +0000</pubDate>
</item>
<item>
<title>Answered: How do I calculate a and z a/2?</title>
<link>https://ask.ghassem.com/525/how-do-i-calculate-a-and-z-a-2?show=526#a526</link>
<description>&lt;p&gt;Given a confidence interval of 95% :&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;$\alpha = 1 - 0.95 = 0.05$&lt;/li&gt;
&lt;li&gt;$z_{\alpha/2} = z_{0.05&amp;nbsp;/ 2} = z_{0.025} = 1.96$&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Given a confidence interval of 99%:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;a = 1 - 0.99&amp;nbsp;= &lt;strong&gt;0.01&lt;/strong&gt;&lt;/li&gt;
&lt;li&gt;z a/2 = 0.01&amp;nbsp;/ 2 = z 0.005&amp;nbsp;= &lt;strong&gt;2.576&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/525/how-do-i-calculate-a-and-z-a-2?show=526#a526</guid>
<pubDate>Thu, 29 Nov 2018 18:26:08 +0000</pubDate>
</item>
<item>
<title>Answered: What is the rule that determines percentages with the mean +- 1,2 or 3 standard deviations?</title>
<link>https://ask.ghassem.com/523/what-rule-that-determines-percentages-standard-deviations?show=524#a524</link>
<description>The Empirical Rule, which is used for normal distribution. The notes about the rule can be found in the slides.</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/523/what-rule-that-determines-percentages-standard-deviations?show=524#a524</guid>
<pubDate>Sat, 17 Nov 2018 19:04:26 +0000</pubDate>
</item>
<item>
<title>Answered: How much percentage of data items are consumed within mean +-1 , +-2, +-3 standard deviation?</title>
<link>https://ask.ghassem.com/518/much-percentage-items-consumed-within-standard-deviation?show=519#a519</link>
<description>For mean +-1 standard deviation, 68% of data items.&lt;br /&gt;
&lt;br /&gt;
For mean +-2 standard deviation, 95% of data items.&lt;br /&gt;
&lt;br /&gt;
For mean +-3 standard deviation, 99.7% of data items.</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/518/much-percentage-items-consumed-within-standard-deviation?show=519#a519</guid>
<pubDate>Wed, 14 Nov 2018 15:05:54 +0000</pubDate>
</item>
<item>
<title>What is a test statistic in hypothesis testing and how does it relate to the p-value?</title>
<link>https://ask.ghassem.com/516/what-test-statistic-hypothesis-testing-and-does-relate-value</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/516/what-test-statistic-hypothesis-testing-and-does-relate-value</guid>
<pubDate>Tue, 13 Nov 2018 00:51:11 +0000</pubDate>
</item>
<item>
<title>Answered: What is sampling error?</title>
<link>https://ask.ghassem.com/514/what-is-sampling-error?show=515#a515</link>
<description>Sampling error occurs when using a sample mean to estimate a population mean. &amp;nbsp;Usually the sample mean is quite close to the population mean, but it is important to understand that there will be some level of sampling error.</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/514/what-is-sampling-error?show=515#a515</guid>
<pubDate>Tue, 13 Nov 2018 00:48:25 +0000</pubDate>
</item>
<item>
<title>What is P-value in Statistical Testing?</title>
<link>https://ask.ghassem.com/510/what-is-p-value-in-statistical-testing</link>
<description>What is P-value in Statistical testing?</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/510/what-is-p-value-in-statistical-testing</guid>
<pubDate>Mon, 12 Nov 2018 13:41:39 +0000</pubDate>
</item>
<item>
<title>Answered: What is a sampling distribution of sample means?</title>
<link>https://ask.ghassem.com/508/what-is-a-sampling-distribution-of-sample-means?show=509#a509</link>
<description>A sampling distribution of sample means is a frequency distribution of all the possible means of samples of a given size n.</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/508/what-is-a-sampling-distribution-of-sample-means?show=509#a509</guid>
<pubDate>Mon, 12 Nov 2018 13:38:44 +0000</pubDate>
</item>
<item>
<title>Answered: In the first week of November 2003.... What is the expected number of homicides per week in the GTA?</title>
<link>https://ask.ghassem.com/503/first-week-november-2003-what-expected-number-homicides-week?show=504#a504</link>
<description>If there are 78 per year then the expected value or mean for a week is 78 homicides/52.1429 weeks which is 1.496 per week.</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/503/first-week-november-2003-what-expected-number-homicides-week?show=504#a504</guid>
<pubDate>Fri, 02 Nov 2018 19:36:25 +0000</pubDate>
</item>
<item>
<title>Answered: In statistics, whats the difference between contingency and frequency tables?</title>
<link>https://ask.ghassem.com/483/statistics-difference-between-contingency-frequency-tables?show=487#a487</link>
<description>Contingency table associates two categorical variables whereas frequency table associates one categorical variable</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/483/statistics-difference-between-contingency-frequency-tables?show=487#a487</guid>
<pubDate>Tue, 30 Oct 2018 11:38:10 +0000</pubDate>
</item>
<item>
<title>Answered: How to discover outliers of a data frame?</title>
<link>https://ask.ghassem.com/455/how-to-discover-outliers-of-a-data-frame?show=456#a456</link>
<description>&lt;p&gt;There are two types of analysis we will follow to find the outliers- Uni-variate(one variable outlier analysis) and Multi-variate(two or more variable outlier analysis).&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Discover outliers with visualization tools&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;Box plot&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;if there is an outlier it will plotted as point in boxplot but other population will be grouped together and display as boxes.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;img alt=&quot;&quot; src=&quot;https://cdn-images-1.medium.com/max/1200/1*HWhJYbFlvR83JXJgaQlQWA.png&quot;&gt;&lt;/p&gt;

&lt;p&gt;Above plot shows three points between 10 to 12, these are outliers as there are not included in the box of other observation i.e nowhere near the quartiles.&lt;/p&gt;

&lt;p&gt;Here we analyzed Uni-variate outlier i.e. we used DIS column only to check the outlier.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;multivariate outlier analysis&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;Scatter plot&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The scatter plot is the collection of points that show values for two variables.&lt;/p&gt;

&lt;p&gt;&lt;img alt=&quot;&quot; height=&quot;218&quot; src=&quot;https://cdn-images-1.medium.com/max/1200/1*L-IHSSCQsTxXVUXmui5O6A.png&quot; width=&quot;428&quot;&gt;&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;Looking at the plot above, we can most of data points are lying bottom left side but there are points which are far from the population like the top right corner.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Discover outliers with a mathematical function&lt;/strong&gt;&lt;/p&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;Z-Score&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;The intuition behind Z-score is to describe any data point by finding their relationship with the Standard Deviation and Mean of the group of data points. Z-score is finding the distribution of data where mean is 0 and the standard deviation is 1 i.e. normal distribution.&lt;/p&gt;

&lt;p&gt;While calculating the Z-score we re-scale and center the data and look for data points which are too far from zero. These data points which are way too far from zero will be treated as the outliers. In most of the cases a threshold of 3 or -3 is used i.e if the Z-score value is greater than or less than 3 or -3 respectively, that data point will be identified as outliers.&lt;/p&gt;

&lt;p&gt;We will use Z-score function defined in scipy library to detect the outliers.&lt;/p&gt;

&lt;pre id=&quot;9ec5&quot;&gt;
from scipy import stats
import numpy as np&lt;/pre&gt;

&lt;pre id=&quot;8c4b&quot;&gt;
z = np.abs(stats.zscore(boston_df))
print(z)&lt;/pre&gt;

&lt;pre id=&quot;8f4e&quot;&gt;
threshold = 3
print(np.where(z &amp;gt; 3))&lt;/pre&gt;

&lt;ul&gt;
	&lt;li&gt;&lt;strong&gt;IQR score&amp;nbsp;&lt;/strong&gt;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;Box plot uses the IQR method to display data and outliers(shape of the data) but in order to get a list of the identified outlier, we will need to use the mathematical formula and retrieve the outlier data.&lt;/p&gt;

&lt;p&gt;IQR is somewhat similar to Z-score in terms of finding the distribution of data and then keeping some threshold to identify the outlier.&lt;/p&gt;

&lt;pre id=&quot;a95e&quot;&gt;
Q1 = boston_df_o1.quantile(0.25)
Q3 = boston_df_o1.quantile(0.75)
IQR = Q3 - Q1
print(IQR)&lt;/pre&gt;

&lt;p&gt;As we now have the IQR scores, it’s time to get hold on outliers. The below code will give an output with some true and false values. The data point where we have False that means these values are valid whereas True indicates presence of an outlier.&lt;/p&gt;

&lt;pre id=&quot;628f&quot;&gt;
print(boston_df_o1 &amp;lt; (Q1 - 1.5 * IQR)) |(boston_df_o1 &amp;gt; (Q3 + 1.5 * IQR))&lt;/pre&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/455/how-to-discover-outliers-of-a-data-frame?show=456#a456</guid>
<pubDate>Sun, 28 Oct 2018 11:21:10 +0000</pubDate>
</item>
<item>
<title>Answered: Define measures of center (Median and Mode) ?</title>
<link>https://ask.ghassem.com/357/define-measures-of-center-median-and-mode?show=448#a448</link>
<description>&lt;p&gt;Median&lt;/p&gt;

&lt;p&gt;The median is the middle score for a set of data that has been arranged in order of magnitude. The median is less affected by outliers and skewed data. In order to calculate the median, suppose we have the data below:&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; style=&quot;width:500px; border-spacing: 0px;&quot;&gt;
	&lt;tbody&gt;
		&lt;tr&gt;
			&lt;td&gt;65&lt;/td&gt;
			&lt;td&gt;55&lt;/td&gt;
			&lt;td&gt;89&lt;/td&gt;
			&lt;td&gt;56&lt;/td&gt;
			&lt;td&gt;35&lt;/td&gt;
			&lt;td&gt;14&lt;/td&gt;
			&lt;td&gt;56&lt;/td&gt;
			&lt;td&gt;55&lt;/td&gt;
			&lt;td&gt;87&lt;/td&gt;
			&lt;td&gt;45&lt;/td&gt;
			&lt;td&gt;92&lt;/td&gt;
		&lt;/tr&gt;
	&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;We first need to rearrange that data into order of magnitude (smallest first):&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; style=&quot;width:500px; border-spacing: 0px;&quot;&gt;
	&lt;tbody&gt;
		&lt;tr&gt;
			&lt;td&gt;14&lt;/td&gt;
			&lt;td&gt;35&lt;/td&gt;
			&lt;td&gt;45&lt;/td&gt;
			&lt;td&gt;55&lt;/td&gt;
			&lt;td&gt;55&lt;/td&gt;
			&lt;td&gt;&lt;strong&gt;56&lt;/strong&gt;&lt;/td&gt;
			&lt;td&gt;56&lt;/td&gt;
			&lt;td&gt;65&lt;/td&gt;
			&lt;td&gt;87&lt;/td&gt;
			&lt;td&gt;89&lt;/td&gt;
			&lt;td&gt;92&lt;/td&gt;
		&lt;/tr&gt;
	&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;Our median mark is the middle mark - in this case, 56 (highlighted in bold). It is the middle mark because there are 5 scores before it and 5 scores after it. This works fine when you have an odd number of scores, but what happens when you have an even number of scores? What if you had only 10 scores? Well, you simply have to take the middle two scores and average the result. So, if we look at the example below:&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; style=&quot;width:500px; border-spacing: 0px;&quot;&gt;
	&lt;tbody&gt;
		&lt;tr&gt;
			&lt;td&gt;65&lt;/td&gt;
			&lt;td&gt;55&lt;/td&gt;
			&lt;td&gt;89&lt;/td&gt;
			&lt;td&gt;56&lt;/td&gt;
			&lt;td&gt;35&lt;/td&gt;
			&lt;td&gt;14&lt;/td&gt;
			&lt;td&gt;56&lt;/td&gt;
			&lt;td&gt;55&lt;/td&gt;
			&lt;td&gt;87&lt;/td&gt;
			&lt;td&gt;45&lt;/td&gt;
		&lt;/tr&gt;
	&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;We again rearrange that data into order of magnitude (smallest first):&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;0&quot; style=&quot;width:500px; border-spacing: 0px;&quot;&gt;
	&lt;tbody&gt;
		&lt;tr&gt;
			&lt;td&gt;14&lt;/td&gt;
			&lt;td&gt;35&lt;/td&gt;
			&lt;td&gt;45&lt;/td&gt;
			&lt;td&gt;55&lt;/td&gt;
			&lt;td&gt;&lt;strong&gt;55&lt;/strong&gt;&lt;/td&gt;
			&lt;td&gt;&lt;strong&gt;56&lt;/strong&gt;&lt;/td&gt;
			&lt;td&gt;56&lt;/td&gt;
			&lt;td&gt;65&lt;/td&gt;
			&lt;td&gt;87&lt;/td&gt;
			&lt;td&gt;
			&lt;p&gt;89&lt;/p&gt;
			&lt;/td&gt;
		&lt;/tr&gt;
	&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;Only now we have to take the 5th and 6th score in our data set and average them to get a median of 55.5.&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;Mode:&lt;/p&gt;

&lt;p&gt;The mode is the most frequent score in our data set. On a histogram it represents the highest bar in a bar chart or histogram. You can, therefore, sometimes consider the mode as being the most popular option. An example of a mode is presented below:&lt;/p&gt;

&lt;p&gt;&lt;img alt=&quot;&quot; height=&quot;432&quot; src=&quot;https://statistics.laerd.com/statistical-guides/img/mode-1.png&quot; width=&quot;436&quot;&gt;&lt;/p&gt;

&lt;p&gt;Normally, the mode is used for categorical data where we wish to know which is the most common category, as illustrated below:&lt;/p&gt;

&lt;p&gt;&lt;img alt=&quot;&quot; height=&quot;342&quot; src=&quot;https://statistics.laerd.com/statistical-guides/img/mode-1a.png&quot; width=&quot;430&quot;&gt;&lt;/p&gt;

&lt;p&gt;We can see above that the most common form of transport, in this particular data set, is the bus. However, one of the problems with the mode is that it is not unique, so it leaves us with problems when we have two or more values that share the highest frequency, such as below:&lt;/p&gt;

&lt;p&gt;&lt;img alt=&quot;&quot; height=&quot;431&quot; src=&quot;https://statistics.laerd.com/statistical-guides/img/mode-2.png&quot; width=&quot;436&quot;&gt;&lt;/p&gt;</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/357/define-measures-of-center-median-and-mode?show=448#a448</guid>
<pubDate>Sat, 27 Oct 2018 14:06:42 +0000</pubDate>
</item>
<item>
<title>Answer reshown: How many unique ways are there to arrange the letters in the word PRIOR?</title>
<link>https://ask.ghassem.com/328/how-many-unique-ways-are-there-arrange-the-letters-word-prior?show=334#a334</link>
<description>The word PRIOR has 5 letters and the letter R repeats twice. So the formula to calculate the answer is:&lt;br /&gt;
&lt;br /&gt;
5!/2! = 5*4*3*2*1 / 2*1 = 120/2 = 60.&lt;br /&gt;
&lt;br /&gt;
The answer is 60.</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/328/how-many-unique-ways-are-there-arrange-the-letters-word-prior?show=334#a334</guid>
<pubDate>Thu, 18 Oct 2018 11:47:51 +0000</pubDate>
</item>
<item>
<title>Retagged: Calculate IQR(Inter Quartile Range) of {18, 24, 19, 16, 21}?</title>
<link>https://ask.ghassem.com/351/calculate-iqr-inter-quartile-range-of-18-24-19-16-21?show=351#q351</link>
<description></description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/351/calculate-iqr-inter-quartile-range-of-18-24-19-16-21?show=351#q351</guid>
<pubDate>Tue, 16 Oct 2018 11:59:59 +0000</pubDate>
</item>
<item>
<title>Answered: How do you tell a permutation problem from a combination problem?</title>
<link>https://ask.ghassem.com/396/how-you-tell-permutation-problem-from-combination-problem?show=397#a397</link>
<description>
&lt;p&gt;The key is finding out if the&lt;strong&gt; order of items&lt;/strong&gt; are important or not. If the order does matter, it will be permutation, otherwise it is combination.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;In these two examples, we should think about context of the words &quot;&lt;strong&gt;Groups&lt;/strong&gt;&quot; and &quot;&lt;strong&gt;Lists&lt;/strong&gt;&quot;.&lt;strong&gt; For Groups, the order is NOT important, but because Lists comes with the&amp;nbsp; index, and the order does matter. &lt;/strong&gt;So, the first question is combination and the second one is permutation.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Combination&lt;/strong&gt;:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;1. How many different &lt;strong&gt;groups &lt;/strong&gt;of 4 students can be made from a class of 40?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;strong&gt;Answer:&lt;/strong&gt;&amp;nbsp; $C(40,4) = \binom{40}{4} = \frac{40!}{4!(40-4)!} $&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Permutation:&amp;nbsp;&lt;/strong&gt;&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;2. How many different &lt;strong&gt;lists&lt;/strong&gt; of 4 students can be made from a class of 40?&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&lt;strong&gt;Answer:&lt;/strong&gt;&amp;nbsp; $P(40,4) &amp;nbsp;= \frac{40!}{(40-4)!} $&lt;/p&gt;
</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/396/how-you-tell-permutation-problem-from-combination-problem?show=397#a397</guid>
<pubDate>Mon, 15 Oct 2018 17:15:22 +0000</pubDate>
</item>
<item>
<title>Reshown: What is the probability of getting a King of Hearts?</title>
<link>https://ask.ghassem.com/359/what-is-the-probability-of-getting-a-king-of-hearts?show=359#q359</link>
<description>What is the probability of picking a King of Hearts in a standard 52 card deck?</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/359/what-is-the-probability-of-getting-a-king-of-hearts?show=359#q359</guid>
<pubDate>Mon, 15 Oct 2018 02:53:32 +0000</pubDate>
</item>
<item>
<title>Answer selected: What is the use of the Poisson Distribution?</title>
<link>https://ask.ghassem.com/376/what-is-the-use-of-the-poisson-distribution?show=377#a377</link>
<description>Generally speaking, the Poisson distribution can be used for probability problems where only the expected number of occurrences is known. Specifically, it gives us the probability of a given number of events happening in a fixed interval of time. &lt;br /&gt;
&lt;br /&gt;
For example, one would use the Poisson distribution for problems like: counts per unit time, defects per unit area, events per unit length, etc .</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/376/what-is-the-use-of-the-poisson-distribution?show=377#a377</guid>
<pubDate>Sun, 14 Oct 2018 13:03:36 +0000</pubDate>
</item>
<item>
<title>Answered: If x is the number that comes up when you roll a 20 sided die. What is the expected value of x?</title>
<link>https://ask.ghassem.com/360/the-number-that-comes-when-you-roll-sided-what-expected-value?show=380#a380</link>
<description>&lt;p&gt;Assign a number to each of those events (having 1 as outcome, having 2 as outcome, ..., having 20 as outcome) which is the job of &lt;strong&gt;Random Variable&lt;/strong&gt;. For this specific example, we can assign the same numbers. The&amp;nbsp;&lt;strong&gt;Expected Value&lt;/strong&gt; equation is $\mu = \sum x.p(x)$. Now you have $x = {1,2,...,20}$, and if&amp;nbsp;you have $p(x)$, you can easily calculate expected value.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;In this case that we have a fair die with 20 sides, probability of each side is $\frac{1}{20}$. Now, we can calculate it in this way:&lt;/p&gt;

&lt;p&gt;$\mu = \sum x.p(x) = \sum_{x=1}^{20}&amp;nbsp;x.\frac{1}{20}&amp;nbsp;&amp;nbsp;= \frac{1}{20} \sum_{x=1}^{20}&amp;nbsp;x =&amp;nbsp;\frac{1}{20} \frac{20\times21}{2} = 10.5&amp;nbsp;&amp;nbsp;$&lt;/p&gt;

&lt;p&gt;(note: $\sum_{x=1}^{n}&amp;nbsp;x = \frac {n\times(n+1)}{2}$)&lt;/p&gt;</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/360/the-number-that-comes-when-you-roll-sided-what-expected-value?show=380#a380</guid>
<pubDate>Sun, 14 Oct 2018 04:22:54 +0000</pubDate>
</item>
<item>
<title>Answered: How do we determine if two events are independent</title>
<link>https://ask.ghassem.com/374/how-do-we-determine-if-two-events-are-independent?show=375#a375</link>
<description>&lt;p&gt;We can call two events&amp;nbsp;&lt;strong&gt;independent&lt;/strong&gt;&amp;nbsp;if the outcome of one of the events doesn&#039;t impact the outcome of another event.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;For instance,&amp;nbsp;we could&amp;nbsp;throw 2 dice and consider the probability that both are 6&#039;s. So, we have thrown our dice. The outcome of the first die throw does not impact the probability of the second die throw. Regardless of what the first outcome was, the second die still has a 1/6 chance of rolling a 6.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/374/how-do-we-determine-if-two-events-are-independent?show=375#a375</guid>
<pubDate>Sat, 13 Oct 2018 15:32:02 +0000</pubDate>
</item>
<item>
<title>Answered: In the context of probability, what is the &quot;event space&quot;</title>
<link>https://ask.ghassem.com/372/in-the-context-of-probability-what-is-the-event-space?show=373#a373</link>
<description>&lt;p&gt;The event space&lt;strong&gt;&amp;nbsp;is a subset of outcomes&lt;/strong&gt; from the sample space. The sample space refers to all possible outcomes of an event.&amp;nbsp;&lt;/p&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/372/in-the-context-of-probability-what-is-the-event-space?show=373#a373</guid>
<pubDate>Sat, 13 Oct 2018 15:10:46 +0000</pubDate>
</item>
<item>
<title>Answered: What is the &quot;Three-Standard-Deviations Rule&quot;</title>
<link>https://ask.ghassem.com/370/what-is-the-three-standard-deviations-rule?show=371#a371</link>
<description>&lt;p&gt;This rule is used to remember the percentage of values that lie around the mean in a normal distribution.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;It is a helpful rule to quickly analyze a normal distribution.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;To reiterate&lt;/strong&gt;,&amp;nbsp;68% of the data is within 1 standard deviation, 95% is within 2 standard deviations, 99.7% is within 3 standard deviations&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/370/what-is-the-three-standard-deviations-rule?show=371#a371</guid>
<pubDate>Sat, 13 Oct 2018 15:04:07 +0000</pubDate>
</item>
<item>
<title>Answered: How do histograms help us understand a data set?</title>
<link>https://ask.ghassem.com/367/how-do-histograms-help-us-understand-a-data-set?show=368#a368</link>
<description>&lt;p&gt;Histograms are very effective for modeling &lt;strong&gt;continuous&lt;/strong&gt; data. When we plot histograms, we have more insight into the underlying distribution. This allows us to identify key traits like outliers and skewness.&lt;/p&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/367/how-do-histograms-help-us-understand-a-data-set?show=368#a368</guid>
<pubDate>Sat, 13 Oct 2018 14:46:18 +0000</pubDate>
</item>
<item>
<title>Answered: What is a discrete random variable?</title>
<link>https://ask.ghassem.com/365/what-is-a-discrete-random-variable?show=366#a366</link>
<description>&lt;p&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://revisionmaths.com/advanced-level-maths-revision/statistics/discrete-random-variables&quot;&gt;https://revisionmaths.com/advanced-level-maths-revision/statistics/discrete-random-variables&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;A &lt;strong&gt;discrete variable&lt;/strong&gt; is a variable that can only take a &lt;em&gt;finite&lt;/em&gt; number of values. The variable is said to be &lt;strong&gt;random&lt;/strong&gt; if the &lt;em&gt;sum&lt;/em&gt; of the probabilities is 1.&lt;/p&gt;</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/365/what-is-a-discrete-random-variable?show=366#a366</guid>
<pubDate>Sat, 13 Oct 2018 01:23:29 +0000</pubDate>
</item>
<item>
<title>Answered: What is the difference between permutation and combination?</title>
<link>https://ask.ghassem.com/363/what-is-the-difference-between-permutation-and-combination?show=364#a364</link>
<description>&lt;p&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://towardsdatascience.com/difference-between-permutation-and-combination-9e12b6763ee1&quot;&gt;https://towardsdatascience.com/difference-between-permutation-and-combination-9e12b6763ee1&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Permutation&lt;/strong&gt; is when the order matters on the selection of objects, whereas &lt;strong&gt;Combination&lt;/strong&gt; is the number of combinations that can be performed on a set of &lt;em&gt;n &lt;/em&gt;objects&lt;/p&gt;</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/363/what-is-the-difference-between-permutation-and-combination?show=364#a364</guid>
<pubDate>Sat, 13 Oct 2018 01:20:38 +0000</pubDate>
</item>
<item>
<title>Answered: What is the difference between Qualitative and Quantitative data?</title>
<link>https://ask.ghassem.com/361/what-the-difference-between-qualitative-quantitative-data?show=362#a362</link>
<description>Qualitative data is not a number, and is typically a text value, whereas Quantitative data is a number (which is then divided into sub-groups: Discrete and Continuous).</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/361/what-the-difference-between-qualitative-quantitative-data?show=362#a362</guid>
<pubDate>Sat, 13 Oct 2018 01:16:53 +0000</pubDate>
</item>
<item>
<title>Answered: What is binomial experiment and when do we use it?</title>
<link>https://ask.ghassem.com/355/what-is-binomial-experiment-and-when-do-we-use-it?show=358#a358</link>
<description>A binomial experiment is a probability experiment where a same process is repeated for certain number of times. &lt;br /&gt;
&lt;br /&gt;
So we use binomial experiment when we know that the events are independent of each others and has only 2 possible outcomes. For example, tossing a coin has 2 outcomes head or tail. Success is denoted as p while failure is denoted as 1- p .</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/355/what-is-binomial-experiment-and-when-do-we-use-it?show=358#a358</guid>
<pubDate>Fri, 12 Oct 2018 22:30:50 +0000</pubDate>
</item>
<item>
<title>What is &quot;Random Sampling&quot;?</title>
<link>https://ask.ghassem.com/356/what-is-random-sampling</link>
<description>Define random sampling</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/356/what-is-random-sampling</guid>
<pubDate>Fri, 12 Oct 2018 22:26:06 +0000</pubDate>
</item>
<item>
<title>Answered: When should we use permutation and combination?</title>
<link>https://ask.ghassem.com/353/when-should-we-use-permutation-and-combination?show=354#a354</link>
<description>Permutations are used when we need to arrange things in a specific order. For example, lets assume there are 10 people and we have to assign them medals. So order is important here as 1st person gets gold while 2nd gets silver. In this case, we use permutations.&lt;br /&gt;
&lt;br /&gt;
Combinations are used when we need to make groups where order doesn&amp;#039;t matter. For example, if we need to give 3 tin cans to 8 people, order doesn&amp;#039;t matter here for the way we pick people. In this case, we use combinations.</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/353/when-should-we-use-permutation-and-combination?show=354#a354</guid>
<pubDate>Fri, 12 Oct 2018 22:22:57 +0000</pubDate>
</item>
<item>
<title>Answered: What do population, parameter, census and sample mean in statistics?</title>
<link>https://ask.ghassem.com/349/what-population-parameter-census-and-sample-mean-statistics?show=350#a350</link>
<description>Population is a set of similar items or events. It can be a group of any existing objects.&lt;br /&gt;
&lt;br /&gt;
Parameter is an very important component of population. It is a numerical quantity which features population or some aspect in it.&lt;br /&gt;
&lt;br /&gt;
Census is a survey which is conducted after a complete observation belonging to any population.&lt;br /&gt;
&lt;br /&gt;
Sample generally refers to a set of observations. Here, it refers to a set of observations drawn from a population.</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/349/what-population-parameter-census-and-sample-mean-statistics?show=350#a350</guid>
<pubDate>Fri, 12 Oct 2018 21:57:50 +0000</pubDate>
</item>
<item>
<title>Answered: Question for Discrete random variable</title>
<link>https://ask.ghassem.com/347/question-for-discrete-random-variable?show=348#a348</link>
<description>μ, of a random variable x is $\mu = \sum x.P(x)$ &amp;nbsp;for all values of x&lt;br /&gt;
&lt;br /&gt;
Here we will subtract the second value as it is in loss.&lt;br /&gt;
&lt;br /&gt;
$\mu = (0.60)(20000) - (0.40)(25000)$ &amp;nbsp;&lt;br /&gt;
&lt;br /&gt;
$\mu = 2000$</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/347/question-for-discrete-random-variable?show=348#a348</guid>
<pubDate>Fri, 12 Oct 2018 21:07:12 +0000</pubDate>
</item>
<item>
<title>Answered: Sample question of permutation and combination.</title>
<link>https://ask.ghassem.com/345/sample-question-of-permutation-and-combination?show=346#a346</link>
<description>Here the order of the r elements does not matter, so we will use combination formula:&lt;br /&gt;
&lt;br /&gt;
= $\frac{n!}{r!\left(n-r\right)!}$&lt;br /&gt;
&lt;br /&gt;
= $\frac{5!}{2!\left(5-2\right)!}$&lt;br /&gt;
&lt;br /&gt;
= 10</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/345/sample-question-of-permutation-and-combination?show=346#a346</guid>
<pubDate>Fri, 12 Oct 2018 20:52:42 +0000</pubDate>
</item>
<item>
<title>Answered: {1.75,  1.63,  1.55,  1.92,  1.81, 1.79, 1.81}. Determine the mean, the median, and the 20th percentile?</title>
<link>https://ask.ghassem.com/343/75-63-55-determine-the-mean-the-median-and-the-20th-percentile?show=344#a344</link>
<description>&lt;p&gt;Mean = (1.75 + 1.63 + 1.55 + 1.92 + 1.81 + 1.79 + 1.81) / 7 = 1.751&lt;/p&gt;

&lt;p&gt;Median = Arrange the values in ascending order, if the total number of values in data set is odd the middle value is median&lt;/p&gt;

&lt;p&gt;1.55, 1.63, 1.75, &lt;span class=&quot;marker&quot;&gt;1.79&lt;/span&gt;, 1.81, 1.81, 1.92&lt;/p&gt;

&lt;p&gt;Percentile =&amp;nbsp;1.55, 1.63, 1.75, 1.79, 1.81, 1.81, 1.92&lt;/p&gt;

&lt;p&gt;A = (nk)/100&amp;nbsp;&lt;/p&gt;

&lt;p&gt;= (7 x 20)/100 = 1.4&lt;/p&gt;

&lt;p&gt;=&amp;gt; 2nd value in data set, which is 1.63&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/343/75-63-55-determine-the-mean-the-median-and-the-20th-percentile?show=344#a344</guid>
<pubDate>Fri, 12 Oct 2018 20:37:02 +0000</pubDate>
</item>
<item>
<title>Answered: What is  difference between discrete numerical variable and continuous numerical variable?</title>
<link>https://ask.ghassem.com/341/difference-discrete-numerical-variable-continuous-numerical?show=342#a342</link>
<description>A discrete numerical variable can be determined by counting a quantity or variables that can only take on a finite number of values are called &amp;quot;discrete variables.&amp;quot; &lt;br /&gt;
&lt;br /&gt;
Variables that can take on an infinite number of possible values are called continuous numerical variable.</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/341/difference-discrete-numerical-variable-continuous-numerical?show=342#a342</guid>
<pubDate>Fri, 12 Oct 2018 19:28:25 +0000</pubDate>
</item>
<item>
<title>Answered: What is the difference between Descriptive Statistics and Inferential Statistics?</title>
<link>https://ask.ghassem.com/326/difference-descriptive-statistics-inferential-statistics?show=335#a335</link>
<description>Answer:&lt;br /&gt;
&lt;br /&gt;
Descriptive Statistics: Collecting, summarizing, and presenting sample data using numerical and graphical methods.&lt;br /&gt;
&lt;br /&gt;
Inferential Statistics: Making estimates, decisions, predictions, or other generalizations about a larger set of data based on sampling.</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/326/difference-descriptive-statistics-inferential-statistics?show=335#a335</guid>
<pubDate>Fri, 12 Oct 2018 02:09:38 +0000</pubDate>
</item>
<item>
<title>Answered: Find the expected value of X from the probability table.</title>
<link>https://ask.ghassem.com/329/find-the-expected-value-of-x-from-the-probability-table?show=333#a333</link>
<description>The expected value of X, E(X) = ∑X*P(X).&lt;br /&gt;
&lt;br /&gt;
E(X) = 1 * 0.3 + 2 * 0.8 + 3 × 0.4&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;= 0.3 + 1.6 + 1.2 = 3.1&lt;br /&gt;
&lt;br /&gt;
Answer: 3.1</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/329/find-the-expected-value-of-x-from-the-probability-table?show=333#a333</guid>
<pubDate>Fri, 12 Oct 2018 02:08:21 +0000</pubDate>
</item>
<item>
<title>Answered: If a die is rolled, find the probability of rolling an even number</title>
<link>https://ask.ghassem.com/327/if-die-is-rolled-find-the-probability-of-rolling-even-number?show=332#a332</link>
<description>First write the S of the scenario. A die has 6 possible numbers.&lt;br /&gt;
S = {1,2,3,4,5,6} &lt;br /&gt;
E in this scenario is all possible even numbers.&lt;br /&gt;
E = {2,4,6} &lt;br /&gt;
Then write the classical probability. &lt;br /&gt;
P(E) = n(E) / n(S) = 3 / 6 = 1 / 2</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/327/if-die-is-rolled-find-the-probability-of-rolling-even-number?show=332#a332</guid>
<pubDate>Fri, 12 Oct 2018 02:07:45 +0000</pubDate>
</item>
</channel>
</rss>