WEBVTT

00:00.390 --> 00:01.270
All right, everyone.

00:01.770 --> 00:05.670
So the next topic, which we are going to learn is stop words.

00:06.360 --> 00:13.000
So in English language or any other language which is being created by these few months, mostly how

00:13.000 --> 00:15.810
such a kind of stop was so technically.

00:15.840 --> 00:18.090
What does this top word here mean?

00:18.720 --> 00:28.410
So let's say in English, there are many times we keep using terms like a can, let's say like maybe

00:28.770 --> 00:29.400
always.

00:30.420 --> 00:36.030
So those term from the grammatical point of view, they have their own meaning.

00:36.480 --> 00:37.990
They convey something.

00:38.400 --> 00:46.380
But from the technical analysis point of view, it doesn't carry much information for any kind of interpretation.

00:46.950 --> 00:51.120
So it is always good practice to remove such a kind of stopwork.

00:51.630 --> 00:56.700
And this P.C. will provide us a list of such a kind of top.

00:57.270 --> 00:59.670
So let me lower the spacy library.

00:59.700 --> 01:06.150
And in the model, the core of a small size model, we are going to load it.

01:06.600 --> 01:12.360
So inside this MLP, there is a number of stoppers already defined through this model.

01:12.390 --> 01:17.300
So if you want to grab it, we can use like an LP.

01:18.420 --> 01:22.370
Let's say device stoppers.

01:22.770 --> 01:23.880
And if your security.

01:25.210 --> 01:27.360
You'll will be able to see the words like no.

01:28.100 --> 01:29.690
Then this is like a positive.

01:30.260 --> 01:35.660
Nobody has still and these kinds of number of words are available.

01:36.080 --> 01:39.980
Which is considered by this particular model is a stop was.

01:41.840 --> 01:43.520
Let's try to find a laptop.

01:43.580 --> 01:45.290
How many staffers exist?

01:45.740 --> 01:51.050
So what we can do instead of print, we can apply on a land function.

01:51.350 --> 01:57.590
So there are total three hundred and twenty six staffers are really well based on which particular model,

01:57.590 --> 01:58.240
Veoh Ludic.

01:58.580 --> 02:04.370
Now, if you look at some other models, like a medium size model or a large model, you may get different

02:04.370 --> 02:05.240
numbers also here.

02:05.540 --> 02:10.760
Now, suppose some work if you want to know whether it's stopwork or not.

02:11.540 --> 02:13.790
So far that you can use like an LP.

02:15.000 --> 02:21.900
Not woke up and here I can pass, like let's say we're like Poly's.

02:23.150 --> 02:30.550
And I can apply these underscore stop attribute, and it will return me through.

02:30.590 --> 02:32.480
That means there's always a tribute.

02:32.750 --> 02:34.610
He's a star for.

02:36.390 --> 02:39.810
Let's try something else so noun's it off.

02:40.170 --> 02:46.170
Let's see some other words like let the finance and security.

02:47.950 --> 02:48.870
It has it can force.

02:48.940 --> 02:53.080
That means this finance he would is not a topless.

02:54.330 --> 03:00.210
Now, this is already pre-defined, Microsoft Stock Awards are available based on this model support

03:00.300 --> 03:06.780
inside this model, if you want to add some keyword has a stop or if you want to remove some keyword

03:06.810 --> 03:09.240
as a stopwork, how you can do it.

03:10.090 --> 03:10.360
Whoops.

03:12.120 --> 03:13.830
Let's say some arbitrary.

03:15.040 --> 03:19.740
He SPF hopes and I want to find whether it's a stopwork or not.

03:20.030 --> 03:22.100
So it's not a stoplight support.

03:22.370 --> 03:28.970
This is the if I just want to add it to my stoppered list, I can just simply use an LP.

03:29.540 --> 03:34.690
Not before, not stoppers.

03:35.060 --> 03:36.620
And I can just simply apply.

03:37.230 --> 03:41.610
Had I met her in here, let me just posit two is D.F..

03:42.380 --> 03:44.690
Now one more word got added to my.

03:45.700 --> 03:46.630
Stopwork wordlist.

03:48.320 --> 03:51.210
And suppose the same court.

03:51.620 --> 03:55.290
Whoops, I should have kept it here if I executed.

03:56.840 --> 03:58.580
And you can see still it's false.

03:58.970 --> 04:00.020
So what we can do?

04:00.080 --> 04:02.390
We can just make it like up through.

04:03.050 --> 04:05.480
And let me execute this part again.

04:05.870 --> 04:08.890
Now, you will be able to see one more word like Harry.

04:09.440 --> 04:10.280
Now what we can do.

04:10.520 --> 04:11.540
Earlier, we got that.

04:12.020 --> 04:14.660
We are total 308, 26 stopwork.

04:15.050 --> 04:21.840
So if we just execute this land function again, now we have a one more words has been added to stop,

04:21.860 --> 04:25.730
or at least that is a three hundred and twenty seven words out of 11.

04:27.130 --> 04:33.800
Now, suppose inside this whole vocabulary of this stopwork, some more, I just want to remove it.

04:34.450 --> 04:37.480
So let's say no, I just don't want to consider this.

04:37.510 --> 04:39.330
No Kiewa as a stopwatch.

04:39.760 --> 04:45.160
So what I can do NLB not, let's say, walk up in place a play.

04:45.220 --> 04:46.480
No, it will.

04:47.740 --> 04:51.370
I need to apply this ease and underscore stop Petrovitch.

04:52.420 --> 04:53.780
And it has written me through.

04:53.820 --> 04:55.870
That means this is stopwork.

04:56.350 --> 04:59.650
Let's just make this has a false.

04:59.920 --> 05:03.400
And now if you just try to display an LP.

05:04.870 --> 05:09.190
Not let's for not stop where list.

05:10.740 --> 05:12.390
So you can see.

05:13.910 --> 05:18.290
There is no no, I think I should print better option.

05:18.610 --> 05:23.090
All right, so you can see the first ever no appears.

05:25.040 --> 05:28.010
So what we can do is just like antimatter.

05:28.310 --> 05:30.950
We can just simply use an LP.

05:32.950 --> 05:38.220
Not be false, dark stop was not removed.

05:38.570 --> 05:39.750
And let just make it.

05:40.100 --> 05:40.520
No.

05:40.870 --> 05:42.410
And again, we are to execute.

05:42.440 --> 05:44.240
This has a false.

05:46.150 --> 05:54.440
Now, if you bring all those top world list again, now you be a legacy that is no exist now.

05:55.360 --> 05:55.720
All right.

05:55.750 --> 05:58.470
So there is the whole story behind StockTwits.

05:59.080 --> 06:01.900
What other divorced Topo's had a really well based on different model.

06:01.930 --> 06:03.970
You will get a different stockhorse.

06:04.360 --> 06:10.060
And if you want to add your custom, stop, or if you want to remove some stop or you don't want to

06:10.060 --> 06:12.630
consider those particular keywords as a stopper.

06:12.910 --> 06:14.430
You can just simply adding order.

06:15.280 --> 06:15.600
All right.

06:15.970 --> 06:17.620
So that is all about the stopwork.

06:17.740 --> 06:18.980
See you in the next video.
