我有一个包含一列字符串的数据框。我正在尝试查找 (a) 列中的最大字数和 (b) 包含具有最大字数的字符串的行。
我做了以下事情:
import pandas as pd
something = ["Hello how are you", "I am doing great", "Lets go camping"]
test = pd.DataFrame(something)
test.columns = ["Response"]
length_of_the_messages = test["Response"].str.split("\\s+")
print(length_of_the_messages)
print(length_of_the_messages.len().max())
但这会产生一个错误,提示 Series
确实没有属性 len
。我怎样才能同时获得列中字符串中的最大单词数及其行索引?
最佳答案
您可以使用 .str
和索引 .idxmax
:
import pandas as pd
something = ["Hello how are you", "I am doing great", "Lets go camping"]
test = pd.DataFrame(something)
test.columns = ["Response"]
length_of_the_messages = test["Response"].str.split("\\s+")
print(length_of_the_messages)
print("Max number of words = ", length_of_the_messages.str.len().max())
print("Index = ", length_of_the_messages.str.len().idxmax())
打印:
0 [Hello, how, are, you]
1 [I, am, doing, great]
2 [Lets, go, camping]
Name: Response, dtype: object
Max number of words = 4
Index = 0
https://stackoverflow.com/questions/67927014/