LOADING

加载过慢请开启缓存,浏览器默认开启

Waiting for the dawn

interview shit network

interview 2023/3/19

interview network part

阅读全文

interview shit- algorithm

2023/3/19

interview algorithm part

阅读全文

C++ -interview STL

2023/3/18

C++ review ptr&stl part

阅读全文

C++ -interview ptr&cast

2023/3/17

C++ review ptr&stl part

阅读全文

C++ interview- key words part

2023/3/16

C++ review key words part

阅读全文

lateral support

2025/3/30
阅读全文

勇敢一点

2025/3/12
阅读全文

勇敢一点

2025/3/9

我真的好害怕,心里老是充满着苦涩,我不知道我能不能成功,如果一无所有会怎么办,🐶和❄️加油了,搞完继续做DF的lateral,我搞不清楚我想要什么了,只能硬着头皮向前走去.

阅读全文

勇敢一点

2025/3/3

加油加油!

阅读全文

qdrant

2025/2/2

how to get embeddings

1. tokenizer

we need a bunch of tokens in order to process via vectordb

for example for a sentence like

I like playing counter strike

first we need to get some tokenizer(In this case BertTokenizer)

first we generate a dictionary using WordPieces

the dictionary contains

[I, like, playing, counter, strike]

for every word which is inside the dictionary, we directly use it, otherwise we split it into even smallar part

for

unlike

we generate something like

["un##", "like"]

we also have some special tokens

[[CLS], [SEP], [PAD]]

Then we generate a map from word to ID

also combine a mask which have 0/1 indicates whether its a PAD or a word

阅读全文
1 2 3 4 5 ... 30
avatar
Yanxin Xiang

愿有一天能和你最重要的人再次相逢