Tensorflow(@CVision)

#مقاله #سورس_کد
The Universal Sentence Encoder is now in TensorFlow.js! USE gives you embeddings for text. It's a great starting point for doing sentiment analysis, topic classification, measuring textual similarity, etc.

https://github.com/tensorflow/tfjs-models/tree/master/universal-sentence-encoder

#nlp #tensorflow

1.4K viewsAlireza Akhavan, edited 05:50

This media is not supported in your browser

VIEW IN TELEGRAM

Text classification using TensorFlow.js: An example of detecting offensive language in browser.

https://medium.com/tensorflow/text-classification-using-tensorflow-js-an-example-of-detecting-offensive-language-in-browser-e2b94e3565ce

#tensorflow_js #nlp #text_classification

2.1K viewsAlireza Akhavan, 11:09

Tensorflow(@CVision)

#سورس_کد
Neural Machine Translation with Luong Attention - Tensorflow 2.0

https://github.com/tensorflow/examples/blob/master/community/en/nmt_with_luong_attention.ipynb

#tensorflow2 #nlp #rnn #attention

2.6K viewsAlireza Akhavan, 19:32

Tensorflow(@CVision)

This media is not supported in your browser

VIEW IN TELEGRAM

این دموی completion آنلاین را با GPT-2 پیاده سازی کردند...
https://transformer.huggingface.co/

مطالب مرتبط:
https://t.me/cvision/1026
https://t.me/cvision/1035

#Transformer #NLP #GPT2

2.2K viewsAlireza Akhavan, edited 15:24

Tensorflow(@CVision)

Language, trees, and geometry in neural networks

A visualization technique to understand BERT.
https://twitter.com/burkov/status/1139391818443808769

#bert #NLP

Twitter

Andriy Burkov

A visualization technique to understand BERT. https://t.co/kP5QPBcdlG

1.9K viewsAlireza Akhavan, 16:53

Tensorflow(@CVision)

#مقاله #پیاده_سازی
XLNet: Generalized Autoregressive Pretraining for Language Understanding

#XLNet outperforms #BERT on 20 tasks, often by a large margin, and achieves state-of-the-art results on 18 tasks including question answering, natural language inference, sentiment analysis, and document ranking. Code and comparisons here:

سورس کد (تنسرفلو)
https://github.com/zihangdai/xlnet

مقاله
https://arxiv.org/abs/1906.08237v1

#NLP

GitHub

GitHub - zihangdai/xlnet: XLNet: Generalized Autoregressive Pretraining for Language Understanding

XLNet: Generalized Autoregressive Pretraining for Language Understanding - zihangdai/xlnet

1.7K viewsAlireza Akhavan, 12:57

Tensorflow(@CVision)

2.4K viewsAlireza Akhavan, 12:57

Tensorflow(@CVision)

طبقه بندی متن با استفاده از Flair

Flair delivers state-of-the-art performance in solving NLP problems such as named entity recognition (NER), part-of-speech tagging (PoS), sense disambiguation and text classification. It’s an NLP framework built on top of PyTorch.

لینک پست: https://towardsdatascience.com/text-classification-with-state-of-the-art-nlp-library-flair-b541d7add21f
کد: https://github.com/zalandoresearch/flair

#text_classification #nlp #flair

Medium

Text Classification with State of the Art NLP Library — Flair

Exciting news! A new version of Flair - state-of-the-art NLP library has just been released. Learn how to use it for text classification

2.7K viewsVahid Reza Khazaie, 05:26

Tensorflow(@CVision)

#خبر
ابزاری برای کمک به تکمیل کد در حین برنامه‌نویسی (code autocompletion) مبتنی بر مدل GPT-2 آموزش داده شده روی حدود ۲ میلیون فایل از گیت‌هاب:
https://tabnine.com/blog/deep?utm_campaign=NLP%20News&utm_medium=email&utm_source=Revue%20newsletter

مرتبط با:
https://t.me/cvision/1035
https://t.me/cvision/1026
🙏Thanks to: @Machine_Learnings

#nlp

3.0K viewsAlireza Akhavan, edited 17:35

Tensorflow(@CVision)

آخرین ریلیز Hugging Face در حوزه #NLP

به روز ترین معماری های عام منظوره مانند (BERT, GPT-2, RoBERTa, XLM, DistilBert, XLNet...) برای درک زبان طبیعی (NLU) و تولید زبان طبیعی (NLG) برای بیش از 32 مدل از پیش آموزش داده شده و برای بیش از 100 زبان با قابلیت جابه‌جایی بین دو فریم‌ورک TensorFlow 2.0 و PyTorch

Code: https://github.com/huggingface/transformers

GitHub

GitHub - huggingface/transformers: 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX. - huggingface/transformers

3.8K viewsVahid, 11:58

Tensorflow(@CVision)

کد و وزن های مدل زبانی 1.5 میلیارد پارامتری GPT-2 منتشر شد...

OpenAI announced the final staged release of its 1.5 billion parameter language model GPT-2, along with all associated code and model weights

لینک خبر:
https://twitter.com/OpenAI/status/1191764001434173440
لینک بلاگ پست:
https://medium.com/syncedreview/openai-releases-1-5-billion-parameter-gpt-2-model-c34e97da56c0

#language_model #gpt2 #nlp #openai

3.0K viewsVahid Reza Khazaie, 06:27

Tensorflow(@CVision)

This media is not supported in your browser

VIEW IN TELEGRAM

با این مدل آموزش داده شده توسط BERT در مرورگر سوال مفهومی بپرسید!

Exploring helpful uses for BERT in your browser with Tensorflow.js

https://blog.tensorflow.org/2020/03/exploring-helpful-uses-for-bert-in-your-browser-tensorflow-js.html?linkId=84231860

#NLP #tfjs

3.6K viewsAlireza Akhavan, edited 18:09

Tensorflow(@CVision)

#nlp #مجموعه_داده
Persian raw text - حدود ۸۰ گیگابایت متن خام فارسی
https://github.com/danyaljj/persian_raw_text

🙏Thanks to: @Sed_Amin_Taheri

GitHub

GitHub - danyaljj/persian_raw_text: Persian raw text

Persian raw text . Contribute to danyaljj/persian_raw_text development by creating an account on GitHub.

2.8K viewsAlireza Akhavan, 19:32

Tensorflow(@CVision)

#مجموعه_داده شامل متن #فارسی

Wiki-40B: Multilingual Language Model Dataset

https://research.google/pubs/pub49029/

#NLP

2.2K viewsAlireza Akhavan, 20:41

Tensorflow(@CVision)

#مجموعه_داده #nlp
مجموعه دادهٔ بزرگ فارسی محاوره‌ای

- شامل ۱۲۰ میلیون جمله فارسی محاوره‌ای از ۲۷ میلیون توئیت
- به همراه درخت اشتقاق، برچسب‌گذاری دستوری (جزء کلام)، قطبیت احساسات و ترجمه هر جمله در زبان انگلیسی، آلمانی، چکی، ایتالیایی و هندی

سایت این پروژه:
https://iasbs.ac.ir/~ansari/lscp/
همچنین شما می‌توانید از http://hdl.handle.net/11234/1-3195 دانلود کنید.

در گوگل کولب نگاهی به این مجموعه داده و پیش‌پردازش های مقدماتی آن بیندازید:
https://colab.research.google.com/drive/1VoY2BjOdI0thx6bpOK5zAxQiP0ry4V70

🙏Thanks to: Alireza Asadi
@cvision

iasbs.ac.ir

LSCP: Enhanced Large Scale Colloquial Persian Language Understanding

4.3K viewsAlireza Akhavan, edited 17:37

Tensorflow(@CVision)

مقاله و کد مدل زبانی GPT-3 منتشر شد!

GPT-3 is an autoregressive language model and trained with 175 billion parameters, 10x more than any previous non-sparse language model before.

Language Models are Few-Shot Learners: https://arxiv.org/abs/2005.14165

GitHub: https://github.com/openai/gpt-3

GPT-3 achieves strong performance on many NLP datasets, including translation, question-answering, and cloze tasks as well as several tasks that require on-the-fly reasoning or domain adaptation, such as unscrambling words, using a novel word in a sentence, or performing 3-digit arithmetics.

#nlp #gpt3

GitHub

GitHub - openai/gpt-3: GPT-3: Language Models are Few-Shot Learners

GPT-3: Language Models are Few-Shot Learners. Contribute to openai/gpt-3 development by creating an account on GitHub.

2.8K viewsVahid Reza Khazaie, 08:53

Tensorflow(@CVision)

آمازون (Amazon)‌ خبر داده چندتا از کورس‌های یادگیری ماشین خودش که برای آموزش کارکنان‌شون تهیه شده‌بودن رو برای استفاده عمومی در اختیار گذاشته.

https://www.amazon.science/latest-news/machine-learning-course-free-online-from-amazon-machine-learning-university

این آموزش‌ها که در سه بخش بینایی‌ماشین، پردازش زبان طبیعی و کار با جداول داده هستند رو می‌تونید در صفحه یوتیوب Amazon's Machine Learning University در آدرس زیر مشاهده کنید:

https://www.youtube.com/channel/UC12LqyqTQYbXatYS9AA7Nuw

#coruse #free #NLP #computer_vision #ML

Amazon Science

Amazon's Machine Learning University is making its online courses available to the public

Classes previously only available to Amazon employees will now be available to the community.

5.7K viewsMohammad H. Sattarian, edited 16:14

Tensorflow(@CVision)

#مجموعه_داده #دیتاست #nlp #فارسی
پیکره‌ متنی ناب.

این پیکره مجموعه پاک‌سازی شده و قابل استفاده مستقیم برای محققان حوزه پردازش زبان طبیعی در فارسی است. این مجموعه شامل حدود ۱۳۰ گیگابایت دیتا متنی شامل ۲۵۰ میلیون پاراگراف و ۱۵ میلیارد کلمه است.
همچنین نسخه خام ناب به همراه اسکریپت پیش پردازش (استفاده شده برای تمیز‌سازی داده) در اختیار عموم قرار گرفته تا به کمک آن بتوانید نسخه تمیز شده‌ پیکره خود را بسازید.
از این پیکره می‌توان برای fine-tune کردن مدل‌های زبانی - که در اصل برای زبان انگلیسی تهیه شده‌اند - برای زبان فارسی نیز استفاده کرد. از جمله این مدل‌های زبانی می‌توان به BERT, BART, T5 و ... اشاره کرد.

+ https://arxiv.org/abs/2208.13486
+ https://huggingface.co/datasets/SLPL/naab
+ https://huggingface.co/datasets/SLPL/naab-raw

@cvision

huggingface.co

SLPL/naab-raw · Datasets at Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

2.9K viewsAlireza Akhavan, edited 05:37

Tensorflow(@CVision)

Stanford University NLP With Deep learning course, Winter 2023 is finally out with notes & hands-on code to practice, do check the link 👇

https://web.stanford.edu/class/cs224n/index.html?utm_source=substack&utm_medium=email#schedule

#منبع #سورس_کد
#nlp #deeplearning #ai

6.1K viewsAlireza Akhavan, 11:09

Tensorflow(@CVision)

New NLP course by Stanford

Stanford University released yesterday a new course -
Intro & Evolution of Natural Language Understanding
(#XCS224U), taught by Prof. Christopher Potts.
As the name implies, the course focuses on core NLP concepts and theory, natural language processing, and machine learning topics. That includes
Domain adaptation for supervised sentiment Retrieval augmented in-context learning
Advanced behavioral evaluation
Analysis methods #NLP methods

https://www.youtube.com/playlist?list=PLoROMvodv4rOwvldxftJTmoR3kRcWkJBp

5.8K viewsAlireza Akhavan, 06:51

About

Blog

Apps

Platform