WEBVTT

00:01.210 --> 00:09.400
All right, so next thing you know, tokenization we will see in this video are the slicing and indexing

00:09.510 --> 00:10.910
into open eyes.

00:11.590 --> 00:17.860
So just like indexing and slicing, we have learned, you know, string class in a patent.

00:17.940 --> 00:21.490
So that's very much basic about the string classes.

00:21.760 --> 00:25.630
So same way you can apply all those slicing and indexing.

00:25.690 --> 00:27.550
You know, knock objects off.

00:27.550 --> 00:32.880
So if you see here this top object, if I just display not four.

00:33.250 --> 00:36.530
So you'll be able to see V, how to grab the type of this.

00:37.360 --> 00:42.250
Otherwise, it will just display those string only whatever we have supplied.

00:43.090 --> 00:46.910
And it is a kind of dark object, spacy Hawkins Dark Dot.

00:47.360 --> 00:52.120
Now let's see if we want to grab how many tokens are dead inside this dark four object.

00:52.630 --> 00:54.370
We can just simply use land function.

00:54.460 --> 00:55.690
That is a will.

00:55.720 --> 00:56.850
Functions are available now.

00:58.260 --> 00:59.030
Let's say, OK.

00:59.650 --> 01:00.540
And that is a seven.

01:00.600 --> 01:05.230
That means that this particular sentence has been divided into seven tokens.

01:05.800 --> 01:13.330
Let's say if you want to grab the very first token, we can use like a doc for hand, just like we indexing

01:13.330 --> 01:15.520
works for a string class and a list.

01:15.520 --> 01:18.920
Plus it will work on this and the class also.

01:19.240 --> 01:21.860
So first broken can be grab with this indexing.

01:22.010 --> 01:22.390
Zero.

01:23.350 --> 01:25.120
So you can see the first one is like.

01:26.310 --> 01:29.580
We can hear even do the slicing also.

01:30.240 --> 01:37.490
So, like for let's say I want to grab from watch the movie, watch out.

01:37.750 --> 01:40.910
And so this is referring to zero, then one.

01:41.290 --> 01:43.410
So from two, three, four and five.

01:43.740 --> 01:44.560
So two to five.

01:44.610 --> 01:45.270
I can grab it.

01:45.870 --> 01:47.460
Upper bound is all of this textually.

01:48.540 --> 01:52.780
So now you can see what up and moving all the tokens.

01:52.920 --> 01:57.840
It has written based on this indexing value here, the negative index also works.

01:58.170 --> 02:00.420
So you can use like a minus one.

02:00.720 --> 02:05.760
So the very last token return nine not case, the last option is not.

02:05.850 --> 02:06.870
So it does return this.

02:08.700 --> 02:15.720
But you just cannot assigned a new convert because it's like a final thing, you just cannot modify.

02:16.260 --> 02:19.890
So let's say dock four zero.

02:20.190 --> 02:22.560
We already know dock four zero zero.

02:24.490 --> 02:29.530
Like, let's say you fight, right, to assign it to something as, let's say some new.

02:31.870 --> 02:37.150
So immediately we got there that this type of object doesn't support the item assignment.

02:37.240 --> 02:39.880
That means you just cannot publicly cooperate.

02:40.000 --> 02:43.810
So that is all about the tokenization basics.

02:44.180 --> 02:49.670
In the next video, we'll see about some more stuff related to an LP basic technique.
