论文栏

字数统计: 1.7k字 | 阅读时长: 7分

2022-07-24

持续更新38, 挖坑61, 论文17

摘要: 本文记录历史经典论文，附本站链接

【对数据分析、人工智能、金融科技、风控服务感兴趣的同学，欢迎关注我哈，阅读更多原创文章】
我的网站：潮汐朝夕的生活实验室
我的公众号：潮汐朝夕
我的知乎：潮汐朝夕
我的github：FennelDumplings
我的leetcode：FennelDumplings

数学

名称	作者	年份	备注	相关文章
《通信的数学理论》	香农	1948	开创信息论	-
《The Theory of Dynamic Programming》	Bellman	1954	奠定动态规划的理论基础	-
[《On the Hypotheses which lie at the Base of Geometry》]	黎曼	1854	开创黎曼几何	-
[《NON-COOPERATIVE GAMES 》]	纳什	1950	奠定非合作博弈理论基础	-

经济金融

名称	作者	年份	备注	相关文章
《Statistical-mechanics-of-money-wealth-and-income》	V.M. Yakovenko	2009	参考统计力学，提出指数型分布在货币、财富和收入中的普遍性	-

大数据系统

名称	作者	年份	备注	相关文章
《MapReduce: Simplified Data Processing on Large Clusters》；中文版	Google	2004	提出MapReduce	-
《Bigtable: A Distributed Storage System for Structured Data》；中文版	Google	2006	分布式结构化数据存储系统	-
《The Google File System》；中文版	Google	2003	分布式文件系统设计	-
《F1: A Distributed SQL Database That Scales》	Google	2013	TPAP数据库	-

数据结构

名称	作者	年份	备注	相关文章
《skip lists: a probabilistic alternative to balanced trees》	william pugh	1990	提出跳表	跳表
《A Fast Algorithm for Finding Dominators in a Flowgraph》	Tarjan	1979	提出在有向图中寻找支配者的算法	有向图的必经点,支配树

算法

名称	作者	年份	备注	相关文章
《Fast Pattern Matching in Strings》	Kruth, Morris, Pratt	1977	提出字符串精确匹配的 KMP 算法	KMP算法与代码模板
《A Fast String Searching Algorithm》	Boyer, Moore	1977	提出字符串精确匹配的 BM 算法	字符串精确匹配的BM算法
《A Very Fast Subtring Search Algorithm》	Sunday	1990	提出 BM 算法的变种 Sunday 算法	字符串精确匹配BM算法的变种：Sunday算法
《Practical Fast Searching in String》	Horspool	1990	提出 BM 算法的变种 Horspool 算法	字符串精确匹配BM算法的变种-Horspool算法
《Suffix arrays: A new method for on-line string searches》	Manber, Myers	1989	提出后缀数组
《File Searching Using Variable Length Keys 》	Briandais	1959	首次提出 Trie 的概念
《A New Linear-Time “On-Line” Algorithm for Finding the Smallest Initial Palindrome of a String》	Manacher	1975	提出回文子串的 Manacher 算法
《Efficient randomized pattern-matching algorithms》	Karp, Rabin	1987	提出RK算法（字符串哈希）
《Efficient String Matching An Aid to Bibliographic Search》	Aho,Corasick	1975	提出多模式匹配的 AC 自动机

计算机系统

名称	作者	年份	备注	相关文章
[《A symbolic analysis of relay and switching circuits》]	香农	1938	奠定数字电路理论基础	-
[《On the Criteria To Be Used in Decomposing Systems into Modules》]	Parnas	1972	提出模块化设计的概念，对软件工程有深远影响
《A Protocol For Packet Network Intercommunication》	Kahn	1974	奠定TCP/IP协议的基础	-
[《New Directions in Cryptography》]	Diffie	1976	提出公钥密码系统
[《A Method for Obtaining Digital Signatures and Public-Key Cryptosystems》]	Rivest	1978	提出数字签名系统和RSA算法
[《A Public-Key Cryptosystem and a Signature Scheme Based on Discrete Logarithms》]	Elgamal	1985	提出基于离散对数的公钥加密和签名方案
[《The Elliptic Curve Digital Signature Algorithm (ECDSA)》]	Johnson	1999	提出基于椭圆曲线的公钥加密
[《Error detecting and error correcting codes》]	Hamming	1950	提出汉明码，是一种信道纠错编码
[《A Method for the Construction of Minimum-Redundancy Codes》]	Huffman	1952	提出哈夫曼编码，是一种无损压缩编码
[《Arithmetic Coding for Data Compression》]	Witten	1984	提出算术编码，是一种无损压缩编码
[《Universal Algorithm for Sequential Data Compression》] 、[《Compression of Individual Sequences via Variable-Rate Coding》]	Jacob Ziv	1977、1978	提出LZ77、LZ78，这是基于字典的压缩算法，用于gzip，png等
[《A Relational Model of Data for Large Shared Data Banks》]	Codd	1970	提出关系型数据库模型
[《The entity-relationship model toward a unified view of data》]	Chen		提出实体-关系模型

数据挖掘

名称	作者	年份	备注	相关文章
[《A statistical interpretation of term specificity and its application in retrieval》]	Karen Jones	1972	提出TF-IDF
[《The PageRank Citation Ranking: Bringing Order to the Web》]	Lawrence Page	1999	提出了PageRank算法
[《The Anatomy of a Large-Scale Hypertextual Web Search Engine》]	Lawrence Page	1998	描述了Google搜索引擎的架构和算法
[《C4.5_Programs for Machine Learning》]	Quinlan	1993	提出 C4.5
[《The link prediction problem for social networks》]	Kleinberg	2003	提出社交网络中的链接预测问题
[《The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain》]	Rosenblatt	1958	提出感知机，开创人工神经网络
[《Some Methods for classification and Analysis of Multivariate Observations》]	MacQueen	1967	提出k-means的早期版本，对聚类算法产生深远影响
[《Least squares quantization in PCM》]	Lloyd	1982	提出k-means的理论基础和实现步骤，对应用起到关键作用
[《Support-Vector Networks》]	Vapnik	1995	提出SVM的理论基础和算法实现
[《Maximum Likelihood from Incomplete Data via the EM Algorithm》]	Dempster	1977	提出EM算法基本原理
[《Fast Algorithms for Mining Association Rules 》]	Agrawal	1994	提出Apriori关联规则算法
[《A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting》]	Freund	1995	提出AdaBoost算法
[《Nearest Neighbor Pattern Classification》]	Cover	1967	提出最近邻算法，奠定kNN的理论基础
[《Discriminant Adaptive Nearest Neighbor Classification and Regression 》]	Hastie	1996	提出kNN算法
[《Classification and Regression Tree》]	Breiman	1984	提出CART分类与回归树
《Idiot’s Bayes: Not So Stupid After All?》	Hand	2001	朴素贝叶斯算法
[《Mining frequent patterns without candidate generation》]	Jiawei Han	2004	提出 FP-Tree 和 FP-growth 挖掘方法
《BIRCH: an efficient data clustering method for very large databases》	ZTian	1996	提出大规模数据集高效聚类的 BIRCH 算法
《Mining Sequential Patterns: Generalizations and Performance Improvements》	Agrawal	1995	提出序列模式挖掘的 GSP 算法

AI

名称	作者	年份	备注	相关文章
[《ImageNet Classification with Deep Convolutional Neural Networks》]	Alex Krizhevsky	2012	提出深度卷积神经网络
[《You Only Look Once: Unified, Real-Time Object Detection》]	Joseph Redmon	2016	提出YOLO实时物体检测

名称	作者	年份	备注	相关文章
《Permutation importance: a corrected feature importance measure》	Andre Altmann	2010	提出特征重要性评估方法：Permutation importance	-
《Deep Residual Learning for Image Recognition》	何凯明	2015	提出ResNet	经典CNN模型-ResNet手写笔记
《Densely Connected Convolutional Networks》	黄高	2017	提出DenseNet	经典CNN模型-DenseNet手写笔记
《Memory-Efficient Implementation of DenseNets》	Geoff Pleiss	2017	提出 DenseNet 内存高效实现	经典CNN模型-DenseNet的内存高效实现
《greedy-function-approximation-GBM》	Jerome H. Friedman	1999	提出GBDT	-
《Practical-Lessons-from-Predicting-Clicks-on-Ads-at-Facebook》	Facebook	2014	提出GBDT+LR	-
《Peeking Inside the Black Box: Visualizing Statistical Learning with Plots of Individual Conditional Expectation》	Alex Goldstein	2014	提出监督学习模型的可解释性方法：个体条件期望(ICE)	-
《Explaining-the-Predictions-of-Any-Classifier》	华盛顿大学	2016	提出对分类器预测的可解释性方法：LIME	-
《Distilling the Knowledge in a Neural Network》	Hinton	2015	提出知识蒸馏概念	-
《A-Unified-Approach-to-Interpreting-Model-Predictions》	Scott M. Lundberg	2017	提出模型可解释性的 SHAP 方法	-
[《Towards A Rigorous Science of Interpretable Machine Learning》]	Doshi-Velez∗	2017	提出一套评估模型可解释性的严格方法	-
《Playing-Atari-with-Deep-Reinforcement-Learning.》	Volodymyr Mnih	2013	提出 DQN	-
《Forecasting-at-Scale》	Facebook	2017	提出时间序列预测框架 prophet	-