Stephen stoped his referral system, I have to setup one by myself, this is not only help myself and also others to take advantage as well. Fix some bugs and config it for easy used: add such line in your webpage, it wil show who have linked to your website in last 24 hours. It is just a refer system, still have bugs and need fix the spam referras....
News Item...
Speaker: Liang Lu, Kevin Wen, Zhihong Mao Abstract The blogging concept was first introduced in China in 2002. It grew rapidly and became popular in merely one year. In 2003, Social Networking and related applications started to surface in China. There are more than 300,000 people who are actively involved in Blogging and Social Networking sites. These new methods of self-expression and communication have started a new Internet trend in China. People are publishing their thoughts and experiences on Blog sites. Social Networking tools help them to establish new connections and enhance existing friendships. Currently, blogging service providers are seeking new opportunities to commercialize their businesses. At the same time, they are adjusting their strategy to comply with evolving government regulations on Internet usage. Social Networking service providers are striving to find a profitable business model. What if we combine these two seemingly separate but yet fundamentally related concepts? Blogging and Social Networking complement each other. Their value can potentially be multiplied through integration. Blogging improves the interaction of Social Networking users. Blogger communities become tighter and stronger via Social Networking. Naturally, this new breed of collaboration and communication application evolves into a personal portal. Refer: http://www.blogdriver.com http://www.uufriends.com...
Before answer this question, we need ask a few questions: 1] Will government ban blog? No, as far as internet exist , China government will have to let BLOG exist. The main reason is that the risk of blog will not exceed the risk of BBS, another style of internet application for group users. 2] What is the bottomline? Sex, Pornography? No, as fas as I can see, there are many blogs who post pron in their blogs and still ok. Like the famous blogger Muzimei and ZhuMuqingtong. Politico-sensitive will be the bottom. And the other history of the CCP will also sensitive topic. These sensitive things will kill the blog and even the whole blog hosting service. So, where to go? Go somewhere that No-Politico-Related area. Too simple? Too Naive? Something simple? No, I mean it should go like this, I mean what I say, no more no less....
3-13-2004, the 3rd largest blogservice BLOGBUS was shutdown by China government. One day later, the largest blog service BLOGCN was also turn down by government. Then, another day later, the 2nd largest blogservice BLOGDRIVER post a notice on it's main page "Maintenance for unknow situition". Even one day later, BLOGDRIVER resume it's service, the other 2 blog service stop work by know, almost 180000 bloggers can access to their blog and don't know what happened. Last weekend, I have a phone meeting with the managers of these 3 blog services and other bloggers. Hengge, manager of Blogbus: We don't know how long it will take, but we have to clean up all these sensitive posts. Huzhiguang, manager of Blogcn: We got a fax from government which claim that we must shut down our servers. Rever, manager of Blogdriver: Still ok, by now. We don't have these sensitive posts and we have filtering program. Thanks to all friends who let us can be heard....
What google search for a item, while differential search engine will search the difference between the two information. Example: A-->B realation is 50% B-->C realation is 40% and there is no chains connect from A-->C, the only connection from A---->C is A-->B-->C, so the realation of A---->C is 50%*40%=20%. The engine caculate the realationship of A-->B B-->C and A---->C is realation search engine. then what is diff-engine: differential search engine will caculate the difference of the B-A and B-C, which will be useful when you want know what is the difference between the very similary information. Example, if you go to www.froogle.com to check sth, you may cmpare the price, but anything else? sure, you also wahnt compare the color, the functions, the features,..... many thing you need compare. So, you know one thing differential search engine can do? :-D It can do more. I am busy in designing it, hopefully I can release the full engine in middle of this year. Liang 3-10-2004...
Just wrote a shell scripts which can auto find the top 600 music mp3 links and download them one by one. here it is: #!/bin/sh # By Liang Lu at 3-9-2004 rm mp3.list html.list wget http://list.mp3.baidu.com/topso/mp3topsong.html cat mp3topsong.html | tr \" \\n | grep htm$ >html.list CC=1 for VAL in `cat html.list` do wget http://list.mp3.baidu.com/topso/$VAL -O $CC.html cat $CC.html | tr \" \\n | grep mp3$ | grep http | head -1 >> mp3.list CC=`expr $CC + 1` done CC=1 for VAL in `cat mp3.list` do echo $CC wget $VAL -O $CC.mp3 CC=`expr $CC + 1` done...
Today I launch the Wikicn.com Which I will change it to be a multi-user wiki, which mean users can register to have their own wiki and start a project and administer his wiki as well. wikicn.com is a machine Celeron 633 with 64M memory running Linux, pretty slow :( Thanks Leafdown and Nio...
Back from Austin(Fifth Annual ASIS&T Information Architecture Summit 2004), very impressive meeting. There is one talk [Using Machine Learning Techniques to Populate Dynamic Interfaces] which talking about the information clustering. This is exactly what I am doing now. But I am not using clustering method, I am using NN(neural network) to autodiscover the information of the documents. After meeting, I talked with the speaker Miles Efron a PostDoc in University of North Carolina, he was impressive with what I was doing. But what I am concern is the systemic error of the NN is too huge. Here is what I have done in last few month: 1] What is the information of one document? Category, key words, Discription, Metadata. Category will be very difficult to allocate if the document it's self didn't assign to a category when it was published. So, my question is :To help to archive a accuracy search result, I need assign all documents to certain category. 2] How to using Neural network to auto-ctegory? (1) I have around 5000 documents which already known the categories, which is devided in to 15 categories: Novel(ID=1), money(ID=2), study(ID=3), social(ID=4)..... Put all {keywords, discriptions, metadatas, contents} in to NN and tell NN that they are belong to 15 categories. My nn is MLP(Multi-Layered-Perceptron ) with 2 layes and 72 neurons. Trainning for 600 times with optimal steps, my NN is pretty stable with the output from 1(Novel); 2(money),..... Which mean my NN can recongaize these documents and already know that which categories they should be. 3] Verification. It maybe over tranning, which can cause very large systemic error. testing it with the new 4500 documents (Already known the categories)which my NN never seen. The result is consistent with the [2](Tranning result). 4] Testing Now, using google download 3000 documents, let them pass my NN, ok, my NN will give a probality of this document be 15 categories: Example: for this document, the result is: Technology: 79% Error:10% Biology:15% Error 4% ..... ..... Social:0% Error:0% Sports:0% Error: 0% The overall systemic error is 14% So, this post will automatic to be assigned to "Technology Category" by my NN machine. BTW, using NN can also generate the Key words and discription, which is something real IA(Information Architecture ) I will focus on when I launch my search engine....
Using rss/xml feed to make a search engine is a shortcut if you just want a search engine. Simple one what I will do in future: Larbin(crawl)-->all links-->sort and find rss-->carwl rss-->paste it in Data base-->search API-->Done. Excite News...