Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

session 003 - LSTM IMDB Sentiment Example Review from Colab #21

Open
John-Cai-ds opened this issue Nov 28, 2021 · 3 comments
Open

session 003 - LSTM IMDB Sentiment Example Review from Colab #21

John-Cai-ds opened this issue Nov 28, 2021 · 3 comments

Comments

@John-Cai-ds
Copy link
Collaborator

John-Cai-ds commented Nov 28, 2021

Why: A time series is a series of data points in time order, which is a sequence with equally spaced points in time. Examples of time series are price of tickets, price of hotel room, daily stock price, etc. It has been widely used in pattern recognition, weather forecasting, earthquake predition, and in any science and enginering which are related to temporal measurements.
Date: 12/04/2021 TBD Skype: https://join.skype.com/WzRQJuTDFrMe
host: @John-Cai-ds
facilitator: @stanghong
https://app.reviewnb.com/suredream/hack-your-ds-interview/blob/main/notebook%2FLSTM_IMDB_Sentiment_Example.ipynb

@suredream suredream changed the title LSTM IMDB Sentiment Example Review from Colab session 003 - LSTM IMDB Sentiment Example Review from Colab Nov 29, 2021
@MeihZ
Copy link

MeihZ commented Dec 4, 2021

非常感谢分享,希望下周可以深入讨论一下:
1) LSTM的应用领域(主要解决什么问题,什么样的data适用)
2.) 实际应用中与其他model的对比
3)How to tune the model? Metrics?
4) in general , RNN application

@MeihZ
Copy link

MeihZ commented Dec 4, 2021

概述一下目前time series 的model 选择和对比

@stanghong
Copy link
Collaborator

stanghong commented Dec 5, 2021

Summary
LSTM比传统classification方法效果要好,但是对计算资源要求较高
处理中TFIDF+RF/XGBOOST有可解释性,bert/training size大,embedding好些,distilbert更会好些
NLP在医疗,保险中的应用很有前景(参考视频录影),LSTM在无人驾驶,voice control中有应用前景
Discrete timesteps, vanishing的问题LSTIM会很好的解决

Questions
1) LSTM的应用领域(主要解决什么问题,什么样的data适用)
2) 实际应用中与其他model的对比
3)How to tune the model? Metrics?
4) in general , RNN application
5) NLP model drifting问题:比如twitter出现新的词汇在traindata里不存在如何处理?
6) 问题notebook里50/50的split,和其他算法不一样,有没有什么样的split guidance?
7) Stopping word处理有没有什么讲究?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants