<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
<channel>
<title>Ask Ghassem - Recent activity in Data Science</title>
<link>https://ask.ghassem.com/activity/data-science</link>
<description>Powered by Question2Answer</description>
<item>
<title>Answered: How to analyse imbalanced categorical colum in dataset</title>
<link>https://ask.ghassem.com/1042/how-to-analyse-imbalanced-categorical-colum-in-dataset?show=1051#a1051</link>
<description>&lt;p&gt;For imbalanced categorical data, you shouldn&#039;t drop the column. Instead, you can try techniques like oversampling the minority class or using models that handle imbalanced &lt;a rel=&quot;follow not-nofollow&quot; href=&quot;https://fatafatkolkata.net/&quot;&gt;&lt;span style=&quot;color:#0f0f0f&quot;&gt;data&lt;/span&gt;&lt;/a&gt; well, like XGBoost. This way, you can still extract useful insights without losing important information.&lt;/p&gt;</description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/1042/how-to-analyse-imbalanced-categorical-colum-in-dataset?show=1051#a1051</guid>
<pubDate>Thu, 19 Feb 2026 18:40:15 +0000</pubDate>
</item>
<item>
<title>Can you verify the validity of this chart comparing the review scores for Marvel Phase 4?</title>
<link>https://ask.ghassem.com/1030/verify-validity-chart-comparing-review-scores-marvel-phase</link>
<description>&lt;p&gt;I have some skepticism about the validity of the charts below comparing the critic and audience reviews for Phase 4 of the MCU to the previous 3 phases. There are over 18 movies and tv shows in Phase 4 compared to the 6 movies in Phases 1 &amp;amp; 2 and the 11 movies in Phase 3. Also, there are far fewer critic reviews for the Phase 4 tv shows than the Phase 4 movies. For example, on Rotten Tomatoes there are only 40 critic reviews for The Falcon and the Winter Soldier and 452 critic reviews for Black Widow. Could this uneven and inconsistent number of reviews between tv shows and movies in Phase 4 be inaccurately making the overall averages higher than they should be? Or do you agree with the conclusions presented in the charts?&lt;/p&gt;

&lt;p&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://cdn.discordapp.com/attachments/997145183172964435/1059948060194652230/image.png&quot;&gt;https://cdn.discordapp.com/attachments/997145183172964435/1059948060194652230/image.png&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://cdn.discordapp.com/attachments/997145183172964435/1049356020469739520/image.png&quot;&gt;https://cdn.discordapp.com/attachments/997145183172964435/1049356020469739520/image.png&lt;/a&gt;&lt;/p&gt;</description>
<category>Exploratory Data Analysis</category>
<guid isPermaLink="true">https://ask.ghassem.com/1030/verify-validity-chart-comparing-review-scores-marvel-phase</guid>
<pubDate>Mon, 09 Jan 2023 16:29:14 +0000</pubDate>
</item>
<item>
<title>How do I compare the count of a value in each year while having a different sanple size each year.</title>
<link>https://ask.ghassem.com/1025/compare-count-value-each-year-while-having-different-sanple</link>
<description>How do I accurately compare between the number of something a survey measure from my employees each year with a varying umber of survey engagement and employee size?&lt;br /&gt;
&lt;br /&gt;
If I was measuring the satisfaction of my employees over the years by collecting a survey from my them each year by asking them wether they are satisfied or not, and then comparing yes’s over the years but the number of employees who answer is not the same each year and the number of employees increases every year. How do I correctly compare this throughout each year?&lt;br /&gt;
&lt;br /&gt;
In other words, how do I remove the effect of the survey engagement rate when calculating the results?</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/1025/compare-count-value-each-year-while-having-different-sanple</guid>
<pubDate>Wed, 08 Jun 2022 10:32:33 +0000</pubDate>
</item>
<item>
<title>Is it possible to make a forecast of a future value of Air Temperature using Fast Fourier Transform?</title>
<link>https://ask.ghassem.com/1024/possible-forecast-future-value-temperature-fourier-transform</link>
<description>Is it possible to make a forecast of a future value of Air Temperature using Fast Fourier Transform, if yes, what should be the process or how you&amp;#039;ll be able to do it. Thank you!</description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/1024/possible-forecast-future-value-temperature-fourier-transform</guid>
<pubDate>Thu, 02 Jun 2022 16:10:26 +0000</pubDate>
</item>
<item>
<title>forecast log transformed fitted values for 2 years using ARMA model</title>
<link>https://ask.ghassem.com/1023/forecast-transformed-fitted-values-years-using-arma-model</link>
<description>Input is a stock price in exponential transformation. We are asked to forecast using ARMA results for 2 years.</description>
<category>Exploratory Data Analysis</category>
<guid isPermaLink="true">https://ask.ghassem.com/1023/forecast-transformed-fitted-values-years-using-arma-model</guid>
<pubDate>Wed, 04 May 2022 20:31:44 +0000</pubDate>
</item>
<item>
<title>Recategorized: Battery data projects</title>
<link>https://ask.ghassem.com/1017/battery-data-projects?show=1017#q1017</link>
<description>Where can I find projects related to battery data?</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/1017/battery-data-projects?show=1017#q1017</guid>
<pubDate>Thu, 03 Mar 2022 09:42:53 +0000</pubDate>
</item>
<item>
<title>How can you build dynamic pricing model with data only from rigid pricing?</title>
<link>https://ask.ghassem.com/1016/build-dynamic-pricing-model-with-data-only-from-rigid-pricing</link>
<description>I want to build a dynamic pricing model which means if product is too expansive for a client and there is a risk that we might loose a client we lower the price for them but if client doesn&amp;#039;t care that much about the price we might increase price a little.&lt;br /&gt;
&lt;br /&gt;
All the articles I&amp;#039;ve seen describe some kind of A/B testing for the pricing and then create a model.&lt;br /&gt;
&lt;br /&gt;
I want to build a model only on the existing rigid pricing data. So I have prices offered to customers and I know who bought the product and who went to other company.&lt;br /&gt;
&lt;br /&gt;
How can I do the increasing price part?</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/1016/build-dynamic-pricing-model-with-data-only-from-rigid-pricing</guid>
<pubDate>Fri, 21 Jan 2022 06:44:31 +0000</pubDate>
</item>
<item>
<title>What analytical software would be good for a company to use?</title>
<link>https://ask.ghassem.com/1015/what-analytical-software-would-be-good-for-a-company-to-use</link>
<description>This would be for a company that is just now looking into using a software to track data for wine making.</description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/1015/what-analytical-software-would-be-good-for-a-company-to-use</guid>
<pubDate>Fri, 14 Jan 2022 16:46:38 +0000</pubDate>
</item>
<item>
<title>How do I know which encoder to use to convert from categorical variables to numerical?</title>
<link>https://ask.ghassem.com/1006/know-which-encoder-convert-categorical-variables-numerical</link>
<description>So say I have a column with categorical data like different styles of temperature: &amp;#039;Lukewarm&amp;#039;, &amp;#039;Hot&amp;#039;, &amp;#039;Scalding&amp;#039;, &amp;#039;Cold&amp;#039;, &amp;#039;Frostbite&amp;#039;,... etc.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
I know that we can use pd.get_dummies to convert the column to numerical data within the dataframe, but I also know that there are other &amp;#039;converters&amp;#039; (not sure if that&amp;#039;s the correct terminology) that we can use, i.e. OneHotEncoder from Sk-learn (like I could use the pipeline module to make a nice pipeline and feed my dataframe through the pipeline to also get my categorical data encoded to numerical).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
How do I know which to use? Does it matter? If it does matter, when does it matter the most (i.e. what types of problems? When there are lots of categorical variables, or few?) If anyone can give me any pointers on this type of stuff I&amp;#039;d greatly appreciate it.</description>
<category>Exploratory Data Analysis</category>
<guid isPermaLink="true">https://ask.ghassem.com/1006/know-which-encoder-convert-categorical-variables-numerical</guid>
<pubDate>Mon, 29 Nov 2021 04:09:06 +0000</pubDate>
</item>
<item>
<title>ValueError: Length mismatch: Expected axis has 60 elements, new values have 2935849 elements</title>
<link>https://ask.ghassem.com/1005/valueerror-length-mismatch-expected-elements-2935849-elements</link>
<description>&lt;p&gt;I&#039;m creating a new data frame&amp;nbsp;with the most used items grouped together. But I got the following error when grouping through ID and items.&amp;nbsp;ValueError: Length mismatch: Expected axis has 60 elements, new values have 2935849 elements.&lt;/p&gt;

&lt;pre class=&quot;prettyprint lang-python&quot; data-pbcklang=&quot;python&quot; data-pbcktabsize=&quot;4&quot;&gt;
df = sales_df[sales_df[&#039;shop_id&#039;].duplicated(keep=False)]
df[&#039;Grouped&#039;] = sales_df.groupby(&#039;shop_id&#039;)[&#039;item_name&#039;].transform(lambda x: &#039;,&#039;.join(x))
df2 = df[[&#039;shop_id&#039;, &#039;Grouped&#039;]].drop_duplicates()&lt;/pre&gt;

&lt;p&gt;In the aforementioned code, I&#039;m making a data frame with respect to shop id and then grouping through shop items. My objective here is to group items with similar ID.&lt;/p&gt;</description>
<category>Exploratory Data Analysis</category>
<guid isPermaLink="true">https://ask.ghassem.com/1005/valueerror-length-mismatch-expected-elements-2935849-elements</guid>
<pubDate>Fri, 26 Nov 2021 06:09:16 +0000</pubDate>
</item>
<item>
<title>Text Mining, Artificial Neural Networks, Speech Processing, Cloud Computing in DS? Essential for a good Data Scientist ?</title>
<link>https://ask.ghassem.com/1004/artificial-networks-processing-computing-essential-scientist</link>
<description></description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/1004/artificial-networks-processing-computing-essential-scientist</guid>
<pubDate>Wed, 27 Oct 2021 19:15:16 +0000</pubDate>
</item>
<item>
<title>Classification of data object might be incorrect</title>
<link>https://ask.ghassem.com/1003/classification-of-data-object-might-be-incorrect</link>
<description>&lt;p&gt;I am learning a new Salesforce product (Evergage) for the company I work for. In the program&#039;s documentation they have listed a set of data objects as an example. It appears to me that the classification might be incorrect. Their system makes a division between &#039;catalog objects&#039; and &#039;profile objects&#039; and the example they have given is a banking institution. They classified &lt;em&gt;Customer Credit Card &lt;/em&gt;as a &lt;em&gt;profile objec&lt;/em&gt;t and &lt;em&gt;Credit Card Level &lt;/em&gt;as a &lt;em&gt;catalog object. &lt;/em&gt;Seems to me that it should be the other way i.e &lt;em&gt;Customer Credit Card = catalog &lt;/em&gt;&lt;em&gt;object &lt;/em&gt;and &lt;em&gt;Credit Card Level &lt;/em&gt;=&amp;nbsp;&lt;em&gt;profile objec&lt;/em&gt;t. Maybe I am not reading the context correctly?&lt;/p&gt;

&lt;p&gt;here is a link to an image with the complete classification: &lt;a rel=&quot;nofollow&quot; href=&quot;https://drive.google.com/file/d/1nG4aX4Ty_NoHxm04AQo1Ow61m3MZ3pXm/view?usp=sharing&quot;&gt;https://drive.google.com/file/d/1nG4aX4Ty_NoHxm04AQo1Ow61m3MZ3pXm/view?usp=sharing&lt;/a&gt;&lt;/p&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/1003/classification-of-data-object-might-be-incorrect</guid>
<pubDate>Mon, 25 Oct 2021 15:26:46 +0000</pubDate>
</item>
<item>
<title>Can Data Science solve this problem?</title>
<link>https://ask.ghassem.com/1002/can-data-science-solve-this-problem</link>
<description>So, I live in Brazil, and I have a task for college that I don&amp;#039;t know what data science method to use, if at all, to solve it. My idea is the following: We Brazilians have Real (BRL) as currency, and we of course have the dollar quotation value to see &amp;quot;how many Reais a dollar is worth&amp;quot;. What I wanted to do was to make a research and see whether the Country News have any influence over this price. So for example, if Bolsonaro, our president, says some dumb stuff, the dollar got up in price, and vice versa. What I wanted to do was collect all dollar values and variance over a set time interval, and try and get webscraping to get the news over some economy sites. Here&amp;#039;s my question then: How can I correlate the news with the dollar variance over a set time? Can data science do that? How do I preprocess this, if at all? Do I need to use bag-of-words? At least I heard so... Please help and thank you for reading.</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/1002/can-data-science-solve-this-problem</guid>
<pubDate>Sun, 24 Oct 2021 15:43:11 +0000</pubDate>
</item>
<item>
<title>should i start as a data analyst then data science?</title>
<link>https://ask.ghassem.com/994/should-i-start-as-a-data-analyst-then-data-science</link>
<description>should I start as a data analyst then data science?&lt;br /&gt;
&lt;br /&gt;
I am a second-year Bachelor&amp;#039;s in Computer Science and wanted to pursue to be a Data Scientist.&lt;br /&gt;
&lt;br /&gt;
However, when I am trying to apply for internships/jobs, most of it requires a Masters&amp;#039;s/Ph.D.&lt;br /&gt;
&lt;br /&gt;
But, a Data Analyst has fewer requirements.&lt;br /&gt;
&lt;br /&gt;
Do you recommend starting off as a Data Analyst and then change to Data Science?</description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/994/should-i-start-as-a-data-analyst-then-data-science</guid>
<pubDate>Mon, 21 Jun 2021 20:31:04 +0000</pubDate>
</item>
<item>
<title>Answered: How best to ensure data quality?</title>
<link>https://ask.ghassem.com/990/how-best-to-ensure-data-quality?show=992#a992</link>
<description>This is really a broad question. The best quality usually comes from a good quality source of data generation.</description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/990/how-best-to-ensure-data-quality?show=992#a992</guid>
<pubDate>Fri, 11 Jun 2021 18:09:27 +0000</pubDate>
</item>
<item>
<title>Searching for movie dataset containing movie synopses/plots?</title>
<link>https://ask.ghassem.com/988/searching-for-movie-dataset-containing-movie-synopses-plots</link>
<description>Hello&lt;br /&gt;
To build a hybrid recommendation system, I used the movielens 1M dataset, for the collaborative filtering part. Now, I&amp;#039;m looking for a database/dataset that contains descriptions/summaries/details/synopses/plots of movies for the content-based recommendation.&lt;br /&gt;
Is there someone who could help me and tell me where I can find a such dataset?&lt;br /&gt;
thank you in advance.</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/988/searching-for-movie-dataset-containing-movie-synopses-plots</guid>
<pubDate>Thu, 27 May 2021 09:57:31 +0000</pubDate>
</item>
<item>
<title>Answered: How to calculate average with deviating sensors?</title>
<link>https://ask.ghassem.com/983/how-to-calculate-average-with-deviating-sensors?show=985#a985</link>
<description>What seems to work is simple: create 4 new columns:&lt;br /&gt;
x=average(3 values) ; y=stdev.p(3 values) ; low threshold = x-y ; high threshold = x+y&lt;br /&gt;
, then repeat the numbers if these are within the boundaries and make a &amp;#039;n/a&amp;#039; if outside of stdev.&lt;br /&gt;
With the values repeated (as within thresholds), the average can be calculated neglecting the extreme values.&lt;br /&gt;
Example:&lt;br /&gt;
10 ; 11 ; 20 : avg=13.67: stdev.p = 4.49; Low=9.17; high=18.16, so&lt;br /&gt;
10 ; 11; n/a &amp;nbsp;(as 20 &amp;nbsp;&amp;gt; 18.16)&lt;br /&gt;
this gives an average of 10.5 &amp;nbsp;&amp;nbsp;:-)&lt;br /&gt;
&lt;br /&gt;
Agree?</description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/983/how-to-calculate-average-with-deviating-sensors?show=985#a985</guid>
<pubDate>Wed, 05 May 2021 11:49:02 +0000</pubDate>
</item>
<item>
<title>Answer selected: Terminology clarification in Spark</title>
<link>https://ask.ghassem.com/979/terminology-clarification-in-spark?show=981#a981</link>
<description>The fact is the engine is still the same, regardless of which interface language you use. For some tasks, such as special cleaning we probably do not have SQL commands, and we have to use Scala or Python. Using Zeppelin, you can switch back and forth among languages the engine supports, however it is not a common practice. For some specific tasks, you can use pure Spark SQL or if you want to use the SQL in pyspark or scala, there are functions that can help you achieve the goal.&lt;br /&gt;
&lt;br /&gt;
I believe observing more examples will help you understand when you can use what.</description>
<category>Big Data Tools</category>
<guid isPermaLink="true">https://ask.ghassem.com/979/terminology-clarification-in-spark?show=981#a981</guid>
<pubDate>Wed, 17 Feb 2021 16:04:05 +0000</pubDate>
</item>
<item>
<title>My GloVe word embeddings contain sentiment?</title>
<link>https://ask.ghassem.com/972/my-glove-word-embeddings-contain-sentiment</link>
<description>&lt;p&gt;I&#039;ve been researching sentiment analysis with word embeddings. I read papers that state that word embeddings ignore sentiment information of the words in the text. One paper states that among the top 10 words that are semantically similar, around 30 percent of words have opposite polarity e.g. happy - sad.&lt;/p&gt;

&lt;p&gt;So, I computed word embeddings on my dataset (Amazon reviews) with the GloVe algorithm in R. Then, I looked at the most similar words with cosine similarity and I found that actually every word is sentimentally similar. (E.g. beautiful - lovely - gorgeous - pretty - nice - love). Therefore, I was wondering how this is possible since I expected the opposite from reading several papers. What could be the reason for my findings?&lt;/p&gt;

&lt;p&gt;Two of the many papers I read:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Yu, L. C., Wang, J., Lai, K. R. &amp;amp; Zhang, X. (2017). Refining Word Embeddings Using Intensity Scores for Sentiment Analysis. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 26(3), 671-681.&lt;/li&gt;
&lt;li&gt;Tang, D., Wei, F., Yang, N., Zhou, M., Liu, T. &amp;amp; Qin, B. (2014). Learning Sentiment-Specific Word Embedding for Twitter Sentiment Classification. Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics, 1: Long Papers, 1555-1565&lt;/li&gt;
&lt;/ul&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/972/my-glove-word-embeddings-contain-sentiment</guid>
<pubDate>Sun, 03 Jan 2021 14:09:37 +0000</pubDate>
</item>
<item>
<title>is it possible to derive a new 95% CI from two separate 95% CIs?</title>
<link>https://ask.ghassem.com/961/is-it-possible-to-derive-a-new-95-ci-from-two-separate-95-cis</link>
<description>&lt;div id=&quot;i4c-draggable-container&quot; style=&quot;position: fixed; z-index: 1499; width: 0px; height: 0px;&quot;&gt;
&lt;div class=&quot;resolved&quot; data-reactroot=&quot;&quot; style=&quot;all: initial;&quot;&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;

&lt;div style=&quot;position: fixed; z-index: 1499; width: 0px; height: 0px;&quot;&gt;
&lt;div style=&quot;all: initial;&quot;&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;

&lt;div style=&quot;position: fixed; z-index: 1499; width: 0px; height: 0px;&quot;&gt;
&lt;div style=&quot;all: initial;&quot;&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;div&gt;&amp;nbsp;&lt;/div&gt;

&lt;div&gt;&amp;nbsp;&lt;/div&gt;

&lt;div id=&quot;i4c-dialogs-container&quot;&gt;&amp;nbsp;&lt;/div&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/961/is-it-possible-to-derive-a-new-95-ci-from-two-separate-95-cis</guid>
<pubDate>Mon, 23 Nov 2020 14:45:19 +0000</pubDate>
</item>
<item>
<title>Probability of a bus arrived in its destination based on weather condition</title>
<link>https://ask.ghassem.com/953/probability-arrived-destination-based-weather-condition</link>
<description>A bus is making its way to a destination. If the weather conditions are favorable today, the likelihood of delay is 3%. If the weather conditions are not favorable today, the likelihood of delay is 50%. The forecast predicts that it is 20% likely that the weather conditions will be favorable today.&lt;br /&gt;
&lt;br /&gt;
1. What is the likelihood that the bus will be delayed?&lt;br /&gt;
&lt;br /&gt;
2. The bus has arrived, but it was delayed. Given that the bus was delayed, what is the likelihood that the weather conditions were favorable?</description>
<category>Discrete Mathematics</category>
<guid isPermaLink="true">https://ask.ghassem.com/953/probability-arrived-destination-based-weather-condition</guid>
<pubDate>Mon, 09 Nov 2020 13:06:47 +0000</pubDate>
</item>
<item>
<title>Answered: How to remove unwanted Jupyter notebook kernels?</title>
<link>https://ask.ghassem.com/947/how-to-remove-unwanted-jupyter-notebook-kernels?show=948#a948</link>
<description>&lt;p&gt;You can see the list of installed kernels:&lt;/p&gt;

&lt;pre class=&quot;prettyprint lang-python&quot; data-pbcklang=&quot;python&quot; data-pbcktabsize=&quot;4&quot;&gt;
jupyter kernelspec list
&lt;/pre&gt;

&lt;p&gt;And then remove those you want with follows (replace the name of the &lt;em&gt;kernel&lt;/em&gt; you want to remove in following command)&lt;/p&gt;

&lt;pre class=&quot;prettyprint lang-python&quot; data-pbcklang=&quot;python&quot; data-pbcktabsize=&quot;&quot;&gt;
jupyter kernelspec remove &lt;em&gt;kernel&lt;/em&gt;&lt;/pre&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/947/how-to-remove-unwanted-jupyter-notebook-kernels?show=948#a948</guid>
<pubDate>Fri, 30 Oct 2020 17:18:25 +0000</pubDate>
</item>
<item>
<title>How can this data be structured for mongodb</title>
<link>https://ask.ghassem.com/889/how-can-this-data-be-structured-for-mongodb</link>
<description>&lt;p&gt;&lt;img alt=&quot;&quot; src=&quot;https://prnt.sc/tkr2g7&quot;&gt;&lt;/p&gt;

&lt;p&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://prnt.sc/tkr2g7&quot; target=&quot;_blank&quot;&gt;https://prnt.sc/tkr2g7&lt;/a&gt; Hello I have a PFE about determining risks of pedestrians, and I have to make a simulator to generate data with something related to this, this is my first time working on this. I would like to know, the structure of data, I will be working with mangodb, so I would love to see an example on JSON&lt;/p&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/889/how-can-this-data-be-structured-for-mongodb</guid>
<pubDate>Sun, 19 Jul 2020 19:08:50 +0000</pubDate>
</item>
<item>
<title>guidance on sequencing data science courses below</title>
<link>https://ask.ghassem.com/844/guidance-on-sequencing-data-science-courses-below</link>
<description>Hello&lt;br /&gt;
my name is lutaaya mudathiru.&lt;br /&gt;
&lt;br /&gt;
I am planning to start data science online&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;professional courses at Harvard&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;University, but i don&amp;#039;t know which course &amp;nbsp;i should begin with . I request for help in sequencing these courses below so that i can&lt;br /&gt;
&lt;br /&gt;
benefitt more:&lt;br /&gt;
&lt;br /&gt;
1. Principles, Statistical and Computational Tools for Reproducible Science.&lt;br /&gt;
&lt;br /&gt;
2.Data Science: Inference and Modeling.&lt;br /&gt;
&lt;br /&gt;
3. Data Science: Productivity Tools&lt;br /&gt;
&lt;br /&gt;
4.Data Science: Wrangling&lt;br /&gt;
&lt;br /&gt;
5.Data Science: Linear Regression.&lt;br /&gt;
&lt;br /&gt;
6.Data Science: Machine Learning&lt;br /&gt;
&lt;br /&gt;
7.Data Science: Capstone&lt;br /&gt;
&lt;br /&gt;
8. Data Science: R Basics&lt;br /&gt;
&lt;br /&gt;
9.DataScience:Visualization&lt;br /&gt;
&lt;br /&gt;
10. DataScience:Probability.&lt;br /&gt;
&lt;br /&gt;
11. High-Dimensional Data Analysis&lt;br /&gt;
&lt;br /&gt;
12. Introduction to Linear Models and Matrix Algebra&lt;br /&gt;
&lt;br /&gt;
13. Data science:Statistics and R&lt;br /&gt;
&lt;br /&gt;
14. Fat Chance: Probability from the Ground Up&lt;br /&gt;
&lt;br /&gt;
15. Introduction to Probability (on edX)</description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/844/guidance-on-sequencing-data-science-courses-below</guid>
<pubDate>Fri, 20 Mar 2020 13:55:49 +0000</pubDate>
</item>
<item>
<title>Answered: What are the differences among Data Science, Artificial Intelligence and Machine Learning?</title>
<link>https://ask.ghassem.com/842/differences-science-artificial-intelligence-machine-learning?show=843#a843</link>
<description>&lt;p&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://www.datacamp.com/resources/whitepapers/ai-and-machine-learning&quot;&gt;This whitepaper&lt;/a&gt;&amp;nbsp;contains great insights on each. In a nutshell:&amp;nbsp;&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Artificial Intelligence (AI)&lt;/strong&gt; is a “a huge set of tools for making computers behave intelligently” and in an automated fashion. This includes voice assistants, recommendation systems, and self-driving cars.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Machine Learning (ML)&lt;/strong&gt; is the “field of study that gives computers the ability to learn without being explicitly programed.” The lion’s share of ML involves computers learning patterns from existing data and applying it to new data in the form of making predictions, such as predicting whether an email is spam or not, whether a customer will churn or not, and diagnosing a particular piece of medical imaging.&lt;/p&gt;

&lt;p&gt;&lt;strong&gt;Data Science (DS)&lt;/strong&gt; is about making discoveries and creating insights from data and communicating these insights and discoveries to non-technical stakeholders.&lt;/p&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/842/differences-science-artificial-intelligence-machine-learning?show=843#a843</guid>
<pubDate>Thu, 05 Mar 2020 03:06:12 +0000</pubDate>
</item>
<item>
<title>Answered: How to convert Jupyter Notebook or a webpage to PDF using Chrome?</title>
<link>https://ask.ghassem.com/840/how-to-convert-jupyter-notebook-or-webpage-pdf-using-chrome?show=841#a841</link>
<description>&lt;p&gt;Please take a look at short video below. However it is shown for Google Colab, it can be applied to other document types (Any webpage!)&lt;/p&gt;

&lt;p&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://www.youtube.com/watch?v=pNbPQmrfVFY&quot;&gt;https://www.youtube.com/watch?v=pNbPQmrfVFY&lt;/a&gt;&lt;/p&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/840/how-to-convert-jupyter-notebook-or-webpage-pdf-using-chrome?show=841#a841</guid>
<pubDate>Thu, 27 Feb 2020 18:41:45 +0000</pubDate>
</item>
<item>
<title>Answered: How to share a Jupyter Notebook document on Google Colab?</title>
<link>https://ask.ghassem.com/836/how-to-share-a-jupyter-notebook-document-on-google-colab?show=837#a837</link>
<description>&lt;p&gt;Please watch the following short video which explains sharing Jupyter Notebooks on Google Colab:&lt;/p&gt;

&lt;p&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://www.youtube.com/watch?v=oJwe1bxA4jQ&quot;&gt;https://www.youtube.com/watch?v=oJwe1bxA4jQ&lt;/a&gt;&lt;/p&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/836/how-to-share-a-jupyter-notebook-document-on-google-colab?show=837#a837</guid>
<pubDate>Thu, 27 Feb 2020 16:19:54 +0000</pubDate>
</item>
<item>
<title>Answered: What are the most common data types in data science?</title>
<link>https://ask.ghassem.com/834/what-are-the-most-common-data-types-in-data-science?show=835#a835</link>
<description>&lt;p&gt;We can model object attributes&amp;nbsp;in the form of two&amp;nbsp;main forms of &lt;strong&gt;Categorical&lt;/strong&gt; and &lt;strong&gt;Numerical&lt;/strong&gt;. A more detailed explanation is available &lt;a rel=&quot;nofollow&quot; href=&quot;http://survivestatistics.com/variables/&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;

&lt;p&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://i.imgur.com/JLbmW8m.jpg&quot;&gt;https://i.imgur.com/JLbmW8m.jpg&lt;/a&gt;&lt;/p&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/834/what-are-the-most-common-data-types-in-data-science?show=835#a835</guid>
<pubDate>Wed, 19 Feb 2020 17:33:08 +0000</pubDate>
</item>
<item>
<title>Commented: How can I prove that following is a tautology (using laws of logical equivalences)</title>
<link>https://ask.ghassem.com/818/prove-that-following-tautology-using-logical-equivalences?show=831#c831</link>
<description>If you can record a video, and explain step by step, that would be much better. It could be confusing how you conclude line 6 from line 5, &amp;nbsp;but if you just record a video and talk over it while solving, that will be clarified. In addition, I removed your personal information such as student number from your post. Please do not include them while posting publicly.</description>
<category>Discrete Mathematics</category>
<guid isPermaLink="true">https://ask.ghassem.com/818/prove-that-following-tautology-using-logical-equivalences?show=831#c831</guid>
<pubDate>Tue, 18 Feb 2020 19:50:20 +0000</pubDate>
</item>
<item>
<title>Answered: can someone send me an online link to discrete mathematics 8th edition textbook?</title>
<link>https://ask.ghassem.com/813/someone-send-online-discrete-mathematics-edition-textbook?show=816#a816</link>
<description>&lt;p&gt;I think &lt;a rel=&quot;nofollow&quot; href=&quot;http://93.174.95.29/_ads/638D5EC2547DEF019458AB88AD39A040&quot;&gt;this link&lt;/a&gt;&amp;nbsp;is what you want. However, it is better you purchase the book&lt;/p&gt;</description>
<category>Discrete Mathematics</category>
<guid isPermaLink="true">https://ask.ghassem.com/813/someone-send-online-discrete-mathematics-edition-textbook?show=816#a816</guid>
<pubDate>Thu, 13 Feb 2020 15:27:35 +0000</pubDate>
</item>
<item>
<title>Answered: How to install Matplotlib</title>
<link>https://ask.ghassem.com/792/how-to-install-matplotlib?show=804#a804</link>
<description>The easiest way is using Anaconda which comes with it pre-installed. You can also use Google Colab.</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/792/how-to-install-matplotlib?show=804#a804</guid>
<pubDate>Sat, 08 Feb 2020 06:46:05 +0000</pubDate>
</item>
<item>
<title>Understanding symbolic language of problem, quantificational  logic</title>
<link>https://ask.ghassem.com/786/understanding-symbolic-language-problem-quantificational</link>
<description>&lt;p&gt;Hi, i am having trouble interpreting the information contained in the relation &lt;strong&gt;R, &lt;/strong&gt;and how it should be applied to the Ps in this problem:&lt;/p&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;blockquote&gt;
&lt;p&gt;Consider the formula&amp;nbsp;&lt;/p&gt;

&lt;h3&gt;∃x∃y∃z(P(x,y)∧P(z,y)∧P(x,z)∧¬P(z,x))&lt;/h3&gt;

&lt;p&gt;Under each pf these interpretations, is this formula true? In each case, R is the relation corresponding to P.&lt;/p&gt;

&lt;p&gt;(a) U = N,&amp;nbsp; &amp;nbsp;&lt;strong&gt;R = {&amp;lt;x,y&amp;gt; : x&amp;lt;y}.&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;(b) U = N,&amp;nbsp; &amp;nbsp;&lt;strong&gt;R = {&amp;lt;x,x+1&amp;gt; : x≥0}.&lt;/strong&gt;&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;&amp;nbsp;&lt;/p&gt;
&lt;p&gt;Does &amp;lt;x,y&amp;gt; refer to the variables x,y or z in each P(a,b), and the :x&amp;lt;y refer to what the relation between these two should be?&lt;/p&gt;

&lt;p&gt;I tried something like this for (a) and got:&lt;/p&gt;

&lt;p&gt;∃x∃y∃z((x&amp;lt;y)∧(z&amp;lt;y)∧(x&amp;lt;z)∧¬(z&amp;lt;x))&lt;/p&gt;

&lt;p&gt;However I&#039;m not sure if this is correct, and I&#039;m not sure how I would do it for (b)&lt;/p&gt;</description>
<category>Discrete Mathematics</category>
<guid isPermaLink="true">https://ask.ghassem.com/786/understanding-symbolic-language-problem-quantificational</guid>
<pubDate>Sun, 26 Jan 2020 11:18:05 +0000</pubDate>
</item>
<item>
<title>Answer selected: What are the values stored in tc8 format for the following numbers?</title>
<link>https://ask.ghassem.com/782/what-are-the-values-stored-tc8-format-for-following-numbers?show=785#a785</link>
<description>Q1-ANS)&lt;br /&gt;
for -117d:&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$-117d = -(01110101)b$&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$-(01110101)b = 10001010(oc8) + 1 = 10001011(tc8)$&lt;br /&gt;
&lt;br /&gt;
for -127d:&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$-127d=-(01111111)b$&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$-(01111111)b = 10000000(oc8) + 1 = 10000001(tc8)$&lt;br /&gt;
&lt;br /&gt;
for 127d:&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$127d = 01111111b = 01111111(tc8)$&lt;br /&gt;
&lt;br /&gt;
for 0d:&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$0d = 00000000b$&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$Two&amp;#039;s complement &amp;nbsp;= one&amp;#039;s complement + 1 = 11111111(oc8) +1 = 00000000(tc8)$&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;due to overflow the 9th digit is removed&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Q2-ANS)&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;1.&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$10010001(tc8) = -(01101110+1)d = -(01101111)b = -((0 × 2^7) + (1 × 2^6) + (1 × 2^5) + (0 × 2^4) + (1 × 2^3) + (1 × 2^2) + (1 × 2^1) + (1 × 2^0))d &amp;nbsp;= -111d$&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;2.&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$00010001(tc8) = ((0 × 2^6) + (0 × 2^5) + (1 × 2^4) + (0 × 2^3) + (0 × 2^2) + (0 × 2^1) + (1 × 2^0))d = 17d$&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;3.&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$01111111(tc8) = ((1 × 2^6) + (1 × 2^5) + (1 × 2^4) + (1 × 2^3) + (1 × 2^2) + (1 × 2^1) + (1 × 2^0))d = 127d$&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;4.&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$0000000 (tc8) = ((0 × 2^6) + (0 × 2^5) + (0 × 2^4) + (0 × 2^3) + (0 × 2^2) + (0 × 2^1) + (0 × 2^0))d = 0d$&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;5.&lt;br /&gt;
&lt;br /&gt;
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;$11111111(tc8) = -(00000000 + 1)b = -(00000001) = -1d$&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Answer submitted by Amirhossein Azizafshari from Thursday 1 PM class.</description>
<category>Discrete Mathematics</category>
<guid isPermaLink="true">https://ask.ghassem.com/782/what-are-the-values-stored-tc8-format-for-following-numbers?show=785#a785</guid>
<pubDate>Fri, 24 Jan 2020 01:02:27 +0000</pubDate>
</item>
<item>
<title>Answer selected: How do i find the inverse of the function?</title>
<link>https://ask.ghassem.com/763/how-do-i-find-the-inverse-of-the-function?show=764#a764</link>
<description>Hey there bro. Let me teach you a trick to solving simple inverse functions. Let&amp;#039;s use your question as an example. Let&amp;#039;s switch the x and the y. This will give us $x = 4y + 5$. Now try and isolate the y again. You should end up with $y = (x-5)/4$. have fun!</description>
<category>Discrete Mathematics</category>
<guid isPermaLink="true">https://ask.ghassem.com/763/how-do-i-find-the-inverse-of-the-function?show=764#a764</guid>
<pubDate>Fri, 06 Dec 2019 04:35:19 +0000</pubDate>
</item>
<item>
<title>Answer selected: How to calculate $g[h(3)]$ if $g(x)= 2x+3$ and $h(x) =4x+5$ ?</title>
<link>https://ask.ghassem.com/766/how-to-calculate-g-h-3-if-g-x-2x-3-and-h-x-4x-5?show=768#a768</link>
<description>You need to calculate $h(3)$ first. $h(3)= 4\times3+5=17$.&lt;br /&gt;
&lt;br /&gt;
Therefore, $g[h(3)]=g(17)= 2\times17+3=37$.</description>
<category>Discrete Mathematics</category>
<guid isPermaLink="true">https://ask.ghassem.com/766/how-to-calculate-g-h-3-if-g-x-2x-3-and-h-x-4x-5?show=768#a768</guid>
<pubDate>Fri, 06 Dec 2019 04:35:09 +0000</pubDate>
</item>
<item>
<title>Answered: Can anyone please solve Q7 pg 329 in Induction?</title>
<link>https://ask.ghassem.com/761/can-anyone-please-solve-q7-pg-329-in-induction?show=762#a762</link>
<description>Never mind, I was able to solve it myself. The trick was rearranging and factoring $5^{k+1}$ and add exponents to get $5^{k+2}$.</description>
<category>Discrete Mathematics</category>
<guid isPermaLink="true">https://ask.ghassem.com/761/can-anyone-please-solve-q7-pg-329-in-induction?show=762#a762</guid>
<pubDate>Thu, 05 Dec 2019 22:10:12 +0000</pubDate>
</item>
<item>
<title>Individual and group relative strength in a fixed pool of players: How to approach the problem?</title>
<link>https://ask.ghassem.com/751/individual-group-relative-strength-players-approach-problem</link>
<description>&lt;div&gt;I apologize in advance if my question sounds too basic to be worthy of anyone&#039;s time, but statistics are not part of my curriculum.&lt;/div&gt;

&lt;div&gt;
&lt;p&gt;I am developing a proof of concept of a web application modeling the contribution of individual soccer player with respect to the different teams they&#039;ve played with throughout their career. In particular, I am looking into a way of &lt;em&gt;ranking&lt;/em&gt; both individuals and groups of players as follows::&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;&lt;strong&gt;teammates relative strength&lt;/strong&gt;: the best/worst combinations of players when playing in the same team in the same matches;&lt;/li&gt;
&lt;li&gt;&lt;strong&gt;opponents relative strength&lt;/strong&gt;: the best/worst combinations of players when playing in opposite teams in the same matches, i.e. which tuples of teammates are the best/worst against which;&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;I must admit I don&#039;t quite know how to approach the problem (as I said I have no formal education in statistics or data science). I would be very grateful&amp;nbsp; if anyone could give me some directions. How should I frame this particular problem and what resources in statistics or machine learning (if indeed this is a task fit for machine learning, perhaps I am mistaken on this) would be appropriate to tackle it?&lt;/p&gt;

&lt;p&gt;I am eager to learn, so both practical examples or theoretical references (book chapters, online articles, etc) would be very welcome.&lt;/p&gt;

&lt;p&gt;Thanks in advance!&lt;/p&gt;
&lt;/div&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/751/individual-group-relative-strength-players-approach-problem</guid>
<pubDate>Tue, 29 Oct 2019 20:00:28 +0000</pubDate>
</item>
<item>
<title>Answered: ideas and opinion on what kind of analyses needs to be done</title>
<link>https://ask.ghassem.com/713/ideas-and-opinion-on-what-kind-of-analyses-needs-to-be-done?show=714#a714</link>
<description>&lt;p&gt;It is a broad question and could have many answers. You can ask many questions and try to answer them based on your main business problem: &quot;the impact of all the products that were discontinued last year on the customers and sales&quot;. For example:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;You can analyze&amp;nbsp;a hypothesis that discontinuing a specific product was a good decision or a bad decision?
&lt;ul&gt;
&lt;li&gt;You can check the sale increase in similar not-discontinued&amp;nbsp;products of the store to investigate if customers who used to buy the discontinued one still looking for it&amp;nbsp;or not. You can build the null hypothesis&amp;nbsp;and alternative hypothesis to analyze&amp;nbsp;the impact statistically.&lt;/li&gt;
&lt;li&gt;You can review customer calls/emails after discontinuing and analyzing&amp;nbsp;the sentiment of the customer for this decision. It could be done using machine learning.&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;

&lt;p&gt;You have to generate questions and try to answer them using the data you have. It totally depends on how you define &lt;em&gt;impact &lt;/em&gt;in your statement. After generating the questions, and finding answers, you can recommend some advice based on&amp;nbsp;your results. For example, if you find out discontinuing the product was a bad decision, you can recommend to continue it again.&lt;/p&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/713/ideas-and-opinion-on-what-kind-of-analyses-needs-to-be-done?show=714#a714</guid>
<pubDate>Thu, 01 Aug 2019 12:31:05 +0000</pubDate>
</item>
<item>
<title>Answered: What are the most important Python libraries for data science?</title>
<link>https://ask.ghassem.com/677/what-are-the-most-important-python-libraries-for-data-science?show=678#a678</link>
<description>&lt;p&gt;Based on what is offered &lt;a rel=&quot;nofollow&quot; href=&quot;https://www.analyticsvidhya.com/blog/2019/07/dont-miss-out-24-amazing-python-libraries-data-science/&quot;&gt;here&lt;/a&gt;, The most important libraries are:&lt;/p&gt;

&lt;ul&gt;
&lt;li&gt;Python Libraries for Data Collection
&lt;ul&gt;
&lt;li&gt;Beautiful Soup&lt;/li&gt;
&lt;li&gt;Scrapy&lt;/li&gt;
&lt;li&gt;Selenium&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Python Libraries for Data Cleaning and Manipulation
&lt;ul&gt;
&lt;li&gt;Pandas&lt;/li&gt;
&lt;li&gt;PyOD&lt;/li&gt;
&lt;li&gt;NumPy&lt;/li&gt;
&lt;li&gt;Spacy&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Python Libraries for Data Visualization
&lt;ul&gt;
&lt;li&gt;Matplotlib&lt;/li&gt;
&lt;li&gt;Seaborn&lt;/li&gt;
&lt;li&gt;Bokeh&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Python Libraries for Modeling
&lt;ul&gt;
&lt;li&gt;Scikit-learn&lt;/li&gt;
&lt;li&gt;TensorFlow&lt;/li&gt;
&lt;li&gt;PyTorch&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Python Libraries for Model Interpretability
&lt;ul&gt;
&lt;li&gt;Lime&lt;/li&gt;
&lt;li&gt;H2O&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Python Libraries for Audio Processing
&lt;ul&gt;
&lt;li&gt;Librosa&lt;/li&gt;
&lt;li&gt;Madmom&lt;/li&gt;
&lt;li&gt;pyAudioAnalysis&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Python Libraries for Image Processing
&lt;ul&gt;
&lt;li&gt;OpenCV-Python&lt;/li&gt;
&lt;li&gt;Scikit-image&lt;/li&gt;
&lt;li&gt;Pillow&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Python Libraries for Database
&lt;ul&gt;
&lt;li&gt;Psycopg&lt;/li&gt;
&lt;li&gt;SQLAlchemy&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;li&gt;Python Libraries for Deployment
&lt;ul&gt;
&lt;li&gt;Flask&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ul&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/677/what-are-the-most-important-python-libraries-for-data-science?show=678#a678</guid>
<pubDate>Mon, 08 Jul 2019 04:43:54 +0000</pubDate>
</item>
<item>
<title>Using aggregate data to generate observation-level data statistically sound?</title>
<link>https://ask.ghassem.com/644/using-aggregate-generate-observation-level-statistically</link>
<description>&lt;p&gt;Context: In the realm of Paid Search Marketing. Current reporting does not provide event level data only aggregate totals with different segments.&amp;nbsp; Want to compare distributions/test statistical significance of A/B test results.&amp;nbsp; Did not want to assume that data followed normal distribution or know STDEV&amp;nbsp;for data so came with this approach.&amp;nbsp;&lt;/p&gt;

&lt;p&gt;My Question: I am going to use the average &quot;CPA&quot; or &quot;CTR&quot; for a date range, and generate an observation for each conversion based off the average for a time range.&amp;nbsp; Is this statistically sound way if I want to generate raw data? Would I have wonky distributions because of the multiple averages?&amp;nbsp; Just want a gutcheck if I&#039;m completely off base.&amp;nbsp;&amp;nbsp;&lt;/p&gt;

&lt;p&gt;My Aggregate data looks like below:&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;1&quot; style=&quot;width:500px&quot;&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th scope=&quot;col&quot;&gt;Day&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;Cost&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;Acquisition&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;CPA or CTR&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp;1&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 40&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 75&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; 3&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;Observation data I generate looks like below:&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;1&quot; style=&quot;width:500px&quot;&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th scope=&quot;col&quot;&gt;Day&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;Acquisition&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;1&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$20&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&amp;nbsp;&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;2&lt;/td&gt;
&lt;td&gt;&amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp; &amp;nbsp;$25&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;I really appreciate your help with this question! An important project to me at work.&amp;nbsp;&amp;nbsp;&lt;/p&gt;</description>
<category>general</category>
<guid isPermaLink="true">https://ask.ghassem.com/644/using-aggregate-generate-observation-level-statistically</guid>
<pubDate>Tue, 11 Jun 2019 22:04:01 +0000</pubDate>
</item>
<item>
<title>Answered: What are the available libraries for continuous time hidden markov models ?</title>
<link>https://ask.ghassem.com/640/what-available-libraries-continuous-hidden-markov-models?show=643#a643</link>
<description>&lt;p&gt;I found some implementations on Python by searching in GitHub. &lt;a rel=&quot;nofollow&quot; href=&quot;https://github.com/lopatovsky/HMMs&quot;&gt;https://github.com/lopatovsky/HMMs&lt;/a&gt;&amp;nbsp;is one of them.&lt;/p&gt;</description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/640/what-available-libraries-continuous-hidden-markov-models?show=643#a643</guid>
<pubDate>Tue, 11 Jun 2019 21:19:58 +0000</pubDate>
</item>
<item>
<title>Answer selected: How to use LinkedIn Learning (formerly Lynda.com) for free?</title>
<link>https://ask.ghassem.com/24/how-to-use-linkedin-learning-formerly-lynda-com-for-free?show=74#a74</link>
<description>&lt;p&gt;If you have a university or college email address, the following photos show you how to sign in &lt;strong&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;http://lynda.com&quot;&gt;Lynda.com&lt;/a&gt;&lt;/strong&gt; using that account.&lt;/p&gt;

&lt;ol&gt;
&lt;li&gt;Click on &lt;strong&gt;Sign&lt;/strong&gt; &lt;strong&gt;in&lt;/strong&gt; on the top menu to see the following page:&lt;br&gt;
&lt;br&gt;
&lt;img alt=&quot;&quot; height=&quot;481&quot; src=&quot;https://i.imgur.com/NmpOmuX.png&quot; width=&quot;600&quot;&gt;&lt;br&gt;
&amp;nbsp;&lt;/li&gt;
&lt;li&gt;Click on&amp;nbsp;&lt;strong&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://www.lynda.com/signin/lynda&quot;&gt;Sign in with Lynda&lt;/a&gt;, &lt;/strong&gt;and then &lt;strong&gt;write your email address of your school or college (for example *Emails are not allowed*)&lt;br&gt;
&lt;br&gt;
&lt;img alt=&quot;&quot; height=&quot;428&quot; src=&quot;https://i.imgur.com/bDchKgb.png&quot; width=&quot;600&quot;&gt;&lt;/strong&gt;&lt;br&gt;
&amp;nbsp;&lt;/li&gt;
&lt;li&gt;It will redirect you to the portal of your organization&amp;nbsp;and after that, you can create a free account on Lynda to use all the contents for free as long as you can sign in this way.&lt;/li&gt;
&lt;li&gt;Some city&amp;nbsp;libraries&#039;&amp;nbsp;LinkedIn Learning:
&lt;ul&gt;
&lt;li&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://www.torontopubliclibrary.ca/detail.jsp?Entt=RDMEDB0187&amp;amp;R=EDB0187&quot;&gt;City of Toronto&lt;/a&gt;&lt;/li&gt;
&lt;li&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://www.rhpl.ca/node/1862&quot;&gt;City of Richmond Hill&lt;/a&gt;&lt;/li&gt;
&lt;/ul&gt;
&lt;/li&gt;
&lt;/ol&gt;


</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/24/how-to-use-linkedin-learning-formerly-lynda-com-for-free?show=74#a74</guid>
<pubDate>Wed, 22 May 2019 06:59:13 +0000</pubDate>
</item>
<item>
<title>Answered: How to reshape in pandas dataframe?</title>
<link>https://ask.ghassem.com/608/how-to-reshape-in-pandas-dataframe?show=609#a609</link>
<description>&lt;p&gt;Did you try to use&lt;strong&gt; reshape(-1,1)&lt;/strong&gt;?&amp;nbsp;&lt;/p&gt;

&lt;pre class=&quot;prettyprint lang-python&quot; data-pbcklang=&quot;python&quot; data-pbcktabsize=&quot;4&quot;&gt;
a = np.array([[1, 2, 3, 4],
         [5, 6, 7, 8],
         [9, 10, 11, 12]])
print(a.shape)
# output: (3, 4)

print(a.reshape(-1,1))
&#039;&#039;&#039;
output:
array([[ 1],
   [ 2],
   [ 3],
   [ 4],
   [ 5],
   [ 6],
   [ 7],
   [ 8],
   [ 9],
   [10],
   [11],
   [12]])
&#039;&#039;&#039;&lt;/pre&gt;


</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/608/how-to-reshape-in-pandas-dataframe?show=609#a609</guid>
<pubDate>Sat, 06 Apr 2019 21:12:13 +0000</pubDate>
</item>
<item>
<title>Answered: Is digital marketing and marketing internships worth it for a data science student?</title>
<link>https://ask.ghassem.com/606/digital-marketing-marketing-internships-science-student?show=607#a607</link>
<description>Depending on what you want to do, and what kind of methodologies you use for analyzing and reporting, an internship in this area could be considered as data science experience.</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/606/digital-marketing-marketing-internships-science-student?show=607#a607</guid>
<pubDate>Fri, 05 Apr 2019 00:38:05 +0000</pubDate>
</item>
<item>
<title>Answered: How to open Jupyter notebook files on Windows or Mac without web browser?</title>
<link>https://ask.ghassem.com/585/how-open-jupyter-notebook-files-windows-mac-without-browser?show=586#a586</link>
<description>&lt;p&gt;If you want an executable application that opens Jupyter notebooks for viewing, printing and executing, &lt;strong&gt;nterac&lt;/strong&gt;&amp;nbsp;(&lt;a rel=&quot;nofollow&quot; href=&quot;https://nteract.io/desktop&quot;&gt;https://nteract.io/desktop&lt;/a&gt;) is here to serve you. On the other hand, it means we can open Jupyter notebooks as standalone executables, instead of running inside the browser.&lt;/p&gt;

&lt;p&gt;However,&amp;nbsp;If you just need preview and do not need to execute, and you have a Mac, use this: &lt;a rel=&quot;nofollow&quot; href=&quot;https://github.com/tuxu/nbviewer-app&quot; target=&quot;_blank&quot;&gt;https://github.com/tuxu/nbviewer-app&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;&lt;a rel=&quot;nofollow&quot; href=&quot;https://cloud.githubusercontent.com/assets/836375/18421299/d95ad398-783b-11e6-8b23-d54cf7caad1e.png&quot;&gt;https://cloud.githubusercontent.com/assets/836375/18421299/d95ad398-783b-11e6-8b23-d54cf7caad1e.png&lt;/a&gt;&lt;/p&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/585/how-open-jupyter-notebook-files-windows-mac-without-browser?show=586#a586</guid>
<pubDate>Sat, 16 Mar 2019 23:38:07 +0000</pubDate>
</item>
<item>
<title>Answer selected: Could you please share a good cheat sheet for logic gates?</title>
<link>https://ask.ghassem.com/181/could-you-please-share-a-good-cheat-sheet-for-logic-gates?show=182#a182</link>
<description>&lt;p&gt;Please take a look at the following table that contains most common logic gates and their truth tables. You can download it by clicking on it.&lt;/p&gt;

&lt;p&gt;&lt;img alt=&quot;&quot; height=&quot;924&quot; src=&quot;https://i.imgur.com/NNBh9Bg.jpg&quot; width=&quot;564&quot;&gt;&lt;/p&gt;</description>
<category>Discrete Mathematics</category>
<guid isPermaLink="true">https://ask.ghassem.com/181/could-you-please-share-a-good-cheat-sheet-for-logic-gates?show=182#a182</guid>
<pubDate>Fri, 15 Mar 2019 17:38:23 +0000</pubDate>
</item>
<item>
<title>Passing variable length sentences to Tensorflow LSTM</title>
<link>https://ask.ghassem.com/561/passing-variable-length-sentences-to-tensorflow-lstm</link>
<description>&lt;p&gt;I have a tensorflow LSTM model for predicting the sentiment. I build the model with the maximum sequence length 150. (Maximum number of words) While making predictions, i have written the code as below:&lt;/p&gt;

&lt;pre class=&quot;prettyprint lang-python&quot; data-pbcklang=&quot;python&quot; data-pbcktabsize=&quot;4&quot;&gt;
batchSize = 32
maxSeqLength = 150

def getSentenceMatrix(sentence):
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;arr = np.zeros([batchSize, maxSeqLength])
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;sentenceMatrix = np.zeros([batchSize,maxSeqLength], dtype=&#039;int32&#039;)
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;cleanedSentence = cleanSentences(sentence)
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;cleanedSentence = &#039; &#039;.join(cleanedSentence.split()[:150])
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;split = cleanedSentence.split()
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;for indexCounter,word in enumerate(split):
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;try:
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;sentenceMatrix[0,indexCounter] = wordsList.index(word)
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;except ValueError:
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;sentenceMatrix[0,indexCounter] = 399999 #Vector for unkown words
&amp;nbsp;&amp;nbsp;&amp;nbsp;&amp;nbsp;return sentenceMatrix

input_text = &quot;example data&quot;
inputMatrix = getSentenceMatrix(input_text)&lt;/pre&gt;

&lt;p&gt;&lt;br&gt;
&lt;br&gt;
In the code i&#039;m truncating my input text to 150 words and ignoring remaining data.Due to this my predictions are wrong.&lt;/p&gt;

&lt;pre class=&quot;prettyprint lang-python&quot; data-pbcklang=&quot;python&quot; data-pbcktabsize=&quot;4&quot;&gt;
cleanedSentence = &#039; &#039;.join(cleanedSentence.split()[:150]) &lt;/pre&gt;

&lt;p&gt;&lt;br&gt;
I know that if we have lesser length than sequence length we can pad with zero&#039;s. What we need to do if we have more length. Can you suggest me the best way to do this. Thanks in advance.&lt;/p&gt;</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/561/passing-variable-length-sentences-to-tensorflow-lstm</guid>
<pubDate>Mon, 11 Feb 2019 05:06:27 +0000</pubDate>
</item>
<item>
<title>Answered: What is degree of Freedom while calculating confidence interval?</title>
<link>https://ask.ghassem.com/544/what-degree-freedom-while-calculating-confidence-interval?show=558#a558</link>
<description>&lt;p&gt;Degree of Freedom is the number of values that are free to vary in the computation of a statistic. For more information please take a look at &lt;a rel=&quot;nofollow&quot; href=&quot;http://www.jerrydallal.com/LHSP/dof.htm&quot;&gt;this article&lt;/a&gt;.&amp;nbsp;&lt;/p&gt;</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/544/what-degree-freedom-while-calculating-confidence-interval?show=558#a558</guid>
<pubDate>Fri, 28 Dec 2018 16:06:55 +0000</pubDate>
</item>
<item>
<title>Answered: What is the easiest way to distinguish whether to use a z value test or a t value test?</title>
<link>https://ask.ghassem.com/553/what-the-easiest-distinguish-whether-value-test-value-test?show=557#a557</link>
<description>&lt;p&gt;Already is answered &lt;a rel=&quot;nofollow&quot; href=&quot;https://askdatascience.com/537/know-when-distribution-instead-regular-confidence-interval?show=538#a538&quot;&gt;here&lt;/a&gt;. Most of the time we do not have distribution parameters, therefore, t-test is more practical.&lt;/p&gt;</description>
<category>Data Science</category>
<guid isPermaLink="true">https://ask.ghassem.com/553/what-the-easiest-distinguish-whether-value-test-value-test?show=557#a557</guid>
<pubDate>Fri, 28 Dec 2018 16:03:46 +0000</pubDate>
</item>
<item>
<title>Answered: How to find the strength of a P-value against a null hypothesis?</title>
<link>https://ask.ghassem.com/545/how-to-find-the-strength-of-a-p-value-against-null-hypothesis?show=556#a556</link>
<description>P &amp;gt; 0.10 &amp;nbsp;&amp;nbsp;--------------- &amp;nbsp;&amp;nbsp;Weak or None&lt;br /&gt;
&lt;br /&gt;
0.05 &amp;lt; P &amp;lt;= 0.10 --------------- &amp;nbsp;Moderate&lt;br /&gt;
&lt;br /&gt;
0.01 &amp;lt; P &amp;lt;= 0.05 ---------------- &amp;nbsp;Strong&lt;br /&gt;
&lt;br /&gt;
P &amp;lt;= 0.01 ---------------------- Very Strong</description>
<category>Statistics</category>
<guid isPermaLink="true">https://ask.ghassem.com/545/how-to-find-the-strength-of-a-p-value-against-null-hypothesis?show=556#a556</guid>
<pubDate>Fri, 28 Dec 2018 16:01:36 +0000</pubDate>
</item>
</channel>
</rss>