python常用pandas函数nlargest / nsmallest及其手动实现

65 阅读 0 评论 0 点赞

在数据分析中，获取数据集中最大或最小的几个值是一个非常常见的需求。在Python的Pandas库中，nlargest和nsmallest函数提供了方便的方法来实现这个功能。本文将深入介绍这两个函数，并提供手动实现的方法。

`nlargest` 和 `nsmallest` 函数

这两个函数的主要作用是从一个DataFrame或Series中返回指定数量的最大值或最小值。在具体的应用中，它们支持对任意列进行排序，并返回所需的行数。

`nlargest` 函数

用法：

DataFrame.nlargest(n, columns, keep='first')

n：要返回的最大值的数量。
columns：用于排序的列名。
keep：用于处理重复值的参数，默认为'first'，可选值包括'first'、'last'和False。

示例：

import pandas as pd

# 创建示例数据
data = {
    'name': ['Alice', 'Bob', 'Charlie', 'David', 'Edward'],
    'score': [85, 92, 78, 90, 88]
}
df = pd.DataFrame(data)

# 获取最高的3个分数
top_scores = df.nlargest(3, 'score')
print(top_scores)

输出：

      name  score
1      Bob     92
3    David     90
4  Edward     88

`nsmallest` 函数

用法：

DataFrame.nsmallest(n, columns, keep='first')

与nlargest的用法几乎相同，只不过返回的是最小的几个值。

示例：

# 获取最低的2个分数
bottom_scores = df.nsmallest(2, 'score')
print(bottom_scores)

输出：

      name  score
2  Charlie     78
0    Alice     85

手动实现

虽然Pandas提供了nlargest和nsmallest函数，但在某些情况下，我们可能需要手动实现同样的功能。这里我们使用Python的内置函数sorted来实现这一过程。

手动实现 `nlargest`

我们可以通过对数据的score列进行排序，然后切片来获取最大的N个值。

def manual_nlargest(df, n, column):
    sorted_df = df.sort_values(by=column, ascending=False)
    return sorted_df.head(n)

# 使用手动实现的nlargest函数
top_scores_manual = manual_nlargest(df, 3, 'score')
print(top_scores_manual)

手动实现 `nsmallest`

同理，获取最小的N个值也可以通过排序来实现。

def manual_nsmallest(df, n, column):
    sorted_df = df.sort_values(by=column, ascending=True)
    return sorted_df.head(n)

# 使用手动实现的nsmallest函数
bottom_scores_manual = manual_nsmallest(df, 2, 'score')
print(bottom_scores_manual)

总结

nlargest和nsmallest函数是Pandas库中非常实用的工具，让我们能轻松地获取数据集中的最大或最小值。即使在没有这些函数的情况下，手动实现也是可行的，虽然可能效率稍低。了解这些函数的底层实现，有助于我们更深入地理解数据处理的各个方面，并能够根据需要进行自定义。无论你是在进行数据科学研究，还是在处理商业数据，掌握这类技术都是非常重要的。

点赞(0) 打赏

本文分类：后端
本文标签：python Python pandas
浏览次数：65 次浏览
发布日期：2024-10-17 08:43:36
本文链接：http://makehui.com/houduan/6760.html

上一篇 > 简单说说Python中的pip如何换源
下一篇 > 教你用 Python 自制简单版《我的世界》

python常用pandas函数nlargest / nsmallest及其手动实现

nlargest 和 nsmallest 函数

nlargest 函数

nsmallest 函数

手动实现

手动实现 nlargest

手动实现 nsmallest

总结

微信扫一扫：分享

【Py/Java/C++三种语言OD独家2024E卷真题】20天拿下华为OD笔试之【模拟】2024E-转骰子【欧弟算法】全网注释最详细分类最全的华为OD真题题解

初级爬虫实战——巴黎圣母院新闻

Python 3.12 环境搭建（Windows版）

Python 列表全方位解析：创建、操作、删除与遍历的全面指南